PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




1Nra ,NAT^AL APPUCATION PUBUSHED U NDER THE PAHMT COOPERAT10NTR^^_ 



(51) International Patent Classification 6 : 

C12N 15/40, 15/54, 15/82, C07K 14/08, 
16/10, C12Q 1/68, G01N 33/53 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



26 June 1997 (26.06.97) 



(21) International Application Number: PCT/US96/20747 

(22) International Filing Date: 20 December 1996 (20.12.96) 



(30) Priority Data: 

60/009,008 



21 December 1995 (21.12.95) US 



(71) Applicant: CORNELL RESEARCH FOUNDATION INC 

[US/US]; Suite 105, 20 Thornwood Drive, Ithaca, NY 14850 
(US). 

(72) Inventors: GONSALVES, Dennis; 595 Castle Street. Geneva, 
{1V ny 14456 (US). LING, Kai-Shu; 170 William Street, 

Geneva, NY 14456 (US). 

(74) Agents: FERBER, Donna. M. et al.; Greenlee Winner and 
Sullivan, Suite 201, 5370 Manhattan Circle, Boulder, CO 
80303 (US). 



(81) Designated States: AL, AM. AT, AU AZ BA BB BG, BR, 
BY. CA, CH. CN, CU, CZ, DE, DK, EE. ES FI, GB, GE, 
HU IL IS JP, KE, KG, KF, KR, KZ, LC f LK, LR, LS, 
Lt'lU, LV, MD, MG, MK, MN, MW, MX, NO, NZ PL, 
PT. RO, RU, SD, SE, SG, SI SK ( TJ, TM, TR, TT . UA. 
UG UZ, VN, ARIPO patent (KE, LS, MW, SD, SZ. UG), 
Eurasian patent (AM, AZ, BY, KG, KZ MD RU Tl TM). 
European patent (AT, BE, CH, DE, DK, ES, FI. TO. GB. 
GR IE IT LU, MC. NL, PT, SE), OAPI patent (BF, BJ, 
CF.'cG, CI, CM. GA, GN, ML, MR, NE, SN, TD, TG). 



Published , , ... , . 

Without international search report and to be republished 

upon receipt of thai report. 



(54) Title: GRAPEVINE LEAFROLL VIRUS PROTEINS AND THEIR USES 
(57) Abstract 

T* present invention relates to an iso.ated ^??*3£»™^^ 
ieafroll virus. IT* encoding DNA molecule ether % leafroll resistance to grape plants by 

isastraE ssz vst* u *. * *- — — 

the DNA molecule of the present invention is also disclosed. 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 

applications under the PCT. 



AM 


Armenia 


GB 


United Kingdom 


AT 


Austria 


GE 


Georgia 


AU 


Australia 


GN 


Guinea 


BB 


Barbados 


GR 


Greece 


BE 


Belgium 


HU 


Hungary 


BF 


Burkina Faso 


IE 


Ireland 


BG 


Bulgaria 


IT 


Italy 


Bj 


Benin 


JP 


Japan 


BR 


Brazil 


KE 


Kenya 


BY 


Belarus 


KG 


Kyrgystan 


CA 


Canada 


KP 


Democratic People's Republic 


CF 


Central African Republic 




of Korea 


CG 


Congo 


KR 


Republic of Korea 


CH 


Switzerland 


KZ 


Kazakhstan 


CI 


Cdte d'lvoirt 


U 


Liechtenstein 


CM 


Cameroon 


LK 


Sri Lanka 


CN 


China 


LR 


Liberia 


CS 


Czechoslovakia 


LT 


Lithuania 


CZ 


Czech Republic 


LU 


Luxembourg 


OK 


Germany 


LV 


Latvia 


DK 


Denmark 


MC 


Monaco 


EE 


Estonia 


MD 


Republic of Moldova 


E5 


Spain 


MG 


Madagascar 


Fl 


Finland 


ML 


Mali 


FR 


France 


MN 


Mongolia 


GA 


Gabon 


MR 


Mauritania 



MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 


SI 


Slovenia 


5K 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TT> 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United Stales of America 


uz 


Uzbekistan 


VN 


Viet Nam 



WO 97/22700 



rCT7US96/20747 



10 



15 



20 



25 



30 



35 



This work was supported by United States -Israel 
Binational Agricultural Research and Development Fund Grant 
No US-1737-89 and by the United States Department of 
Agriculture Cooperative Agreement No. 58-2349-9-01. The 
Federal Government may have certain rights in the invention. 

FIELD OP THE INVENTION 

The present invention relates to grapevine leafroll virus 
proteins, DNA molecules encoding these proteins, and their 
uses. 

BACKGROUND OF THE INVENTION 

The world's most widely grown fruit crop, the grape 
(Vitis bp.), is cultivated on all continents except 
Antarctica. Major grape production centers are in European 
countries (including Italy, Spain, and France) , which 
constitute about 70% of the world grape production (Mullms et 
al., nirl-T-nf r*-"™~*™- Cambridge, U.K., University 
Press (1992)). The United States is the eighth largest grape 
grower in the world. Although grapes have many uses, a manor 
portion of grape production (-80%) is used for wine 
production. Unlike cereal crops, most of the world's 
vineyards are planted with traditional grapevine cultivars, 
which have been perpetuated for centuries by vegetative 
propagation. Several important grapevine virus and virus-like 
diseases, such as grapevine leafroll. corky bark, and 
Rupestris stem pitting, are transmitted and spread through the 
use of infected vegetatively propagated materials. Thus, 
propagation of certified, virus-free materials is one of the 
m os"t important disease control measures. Traditional breeding 
for disease resistance is difficult due to the highly 
heterozygous nature and outcrossing behavior of grapevines, 
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and due to polygenic patterns of inheritance. Moreover, 
introduction of a new cultivar may be prohibited by custom or 
law. Recent biotechnology developments have made possible the 
introduction of special traits, such as disease resistance, 
into an established cultivar without altering its 
horticultural characteristics. 

Many plant pathogens, such as fungi, bacteria, 
phytoplasmas, viruses, and nematodes can infect grapes, and 
the resultant diseases can cause substantial losses in 
production (Pearson et al., compendium of firapft Diseases , 
American Phytopathological Society Press (1988)). Among 
these, viral diseases constitute a major hindrance to 
profitable growing of grapevines. About 34 viruses hav 2 been 
isolated and characterized from grapevines. The major virus 
diseases are grouped into: (1) the grapevine degeneration 
caused by the fanleaf nepovirus, other European nepoviruses, 
and American nepoviruses, (2) the leaf roll complex, and (3) 
the rugose wood complex (Martelli, ed. , firafr Transmiss i ble 
Pii-n^ T r * rc^vi PPs Handbo ok fry Petition and Diagnos i s . 
0 FAO, UN, Rome, Italy (1993)). 

Grapevine leafroll complex is the most widely distributed 
of the major diseases of grapes. According to Goheen (Goheen, 
-Grape Leafroll," in Frazier et al., eds., V^ l ff TUfift a aes of 
Qm»n PnHtg ro-anpvines ffl Handbook) , University of 
5 California, Division of Agricultural Sciences, Berkeley, 
Calif, USA, pp. 209-212 (1970), grapevine leafroll-like 
disease was described as early as the 1850s in German and 
French literature. The viral nature of the disease and graft 
transmission were first demonstrated by Scheu (Scheu, R^JL. 
0 WPinbau 14:222-358 (1935). In 1946, Harmon and Snyder (Harmon 
et al., snr Wort. Sci. 74:190-194 (1946)) 

determined the virus nature of White Emperor disease in 
California. It was later proven by Goheen et al . (Goheen et 
al., PVT a ^" 1ogv - 48:51-54 (1958)) that both leafroll and 
5 "White Emperor" diseases were the same, and only the name 
"leafroll" was retained. 
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Leafroll is a serious virus disease of grapes and occurs 
wherever grapes are grown. This wide distribution of the 
disease has come about through the propagation ox diseased 
vines. It affects almost all cultivated and rootstock 
varieties of Vitis. Although the disease is not lethal, it 
causes yield losses and reduced sugar content. Scheu 
estimated in 1936 that 80 per cent of all grapevines planted 
in Germany were infected (Scheu, MHn WinT i srbuch , Berlin, 
Reichsnahrstand-Verlags (1936) ) . In many California wine 
grape vineyards, the incidence of leafroll (based on a 1959 
survey of field symptoms) agrees with Scheu" s initial 
observation in German vineyards (Goheen et al . , bmr , J . End ■ 
viMc. . 10:78-84 (1959)). The current situation on leafroll 
disease appears similar (Goheen, The American 
D >, y ^ ra ^rOoc r-i™i finrietv. St. Paul, Minnesota : APS Press, 
1:47-54 (1988). Goheen estimated that the disease causes an 
annual loss of about 5-20 per cent of the total grape 
production (Goheen (1970); Goheen (1988)). The amount of 
sugar in individual berries of infected vines is only about 
1/2 to 2/3 that of berries from noninfected vines (Goheen 
(1958) ) . 

Symptoms of leafroll disease vary considerably depending 
upon the cultivar, environment, and time of the year. On red 
or dark-colored fruit varieties, the typical downward rolling 
and interveinal reddening of basal, mature leaves is prevalent 
in autumn and is less apparent in spring or early summer. On 
light -colored fruit varieties, symptoms are less conspicuous, 
usually downward rolling accompanied by interveinal chlorosis. 
Moreover, many infected rootstock cultivars do not develop 
symptoms. In these cases, the disease is usually diagnosed 
with a woody indicator indexing assay using Vitis vivifera cv. 
Carbemet Franc (Goheen (1988)). 

Ever since Scheu demonstrated that leafroll was graft 
transmissible, a virus etiology has been suspected (Scheu 
(1935)). Several virus particle types have been isolated from 
leafrcll diseased vines. These include potyvirus-like (Tanne 
et al., Phytrra-Hnlogv . 67:442-447 (1977)), isometric virus- 
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like (Castellano et al., V^tis , 22:23-39 (1983)) and Namba et 
al., Ann . Phvi-npathol . .^nr . Japan , 45:70-73 (1979)), and 
closterovirus-like (Namba, fln^ ■ Phytopathol . Soc, Japan/ 
45:497-502 (1979)) particles. In recent years, however, long 
5 flexuous closteroviruses ranging from 1,4 00 to 2,200 nra in 
length have been most consistently associated with leafroll 
disease (Castellano (1983), Faoro et al . , Riv, Patol ■ Veg. , 
Ser IV . 17:183-189 (1981); Gugerli et al., Rev, Suisse 
Vifcicult, Arboricu ll- - Hort . . 16:299-304 (1984); Hu et al., jL. 
10 Phytopathol . . 128:1-14 (1990); Milne et al . , PhytOPathol ,,Z.i 
110:360-368 (1984); Zee et al . , Phytopathology, 77:1427-1434 
(1987); Zimmermann et al., J , Phytopathol , , 130:205-218 

(1990) . These closteroviruses are referred to as grapevine 
leafroll associated viruses (GLRaV) . At least six 

15 serologically distinct types of GLRaV 1 s (GLRaV- 1 to -6) have 
been detected from leafroll diseased vines (Table 1) (Boscia 
et al., Vnf.is - 34:171-175 (1995); (Martelli, "Leafroll," pp. 
37-44 in Martelli, ed., ^ft Transmissible Diseases of 
Grapevines. Handbook for ftpt-pr-tion and Diagnosis, FAO, Rome 

20 Italy, (1993)). The first five of these were confirmed in the 
10th Meeting of the International Council for the Study of 
Virus and Virus Diseases of the Grapevine (ICVG) (Volos, 
Greece, 1990) . Through the use of monoclonal antibodies, 
however, the original GLRaV II described in Gugerli (1984) has 

25 been shown to be an apparent mixture of at least two 

components, Ila and lib (Gugerli et al., "Grapevine Leafroll 
Associated Virus II Analyzed by Monoclonal Antibodies," 11th 
Meeting of rhe internationa l Ponncj 1 for the Study of Viruses 
and virus n-ispassg of the Grapevine. Montreux, Switzerland, 

30 pp. 23-24 (1993)). Recent investigation with comparative 

serological assays (Boscia (1995)) demonstrated that the lib 
component of cv. Chasselas 8/22 is the same as the GLRaV- 2 
isolate from France (Zimmermann (1990)) which also include the 
isolates of grapevine corky bark associated closteroviruses 

35 from Italy (GCBaV-BA) (Boscia (1995)) and from the United 

States (GCBaV-NY) (Namba et al., Phytopathology, 81:964-970 

(1991) ). The Ila component of cv. Chasselas 6/22 was given 
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the provisional name of grapevine leafroll associated virus 6 
(GLRaV-6) . Furthermore, the antiserum to the CA-5 isolate of 
GLRaV-2 produced by Boscia et ai. (Boscia et al., 
Phytopatholocry , 80:117 (1990)) was shown to contain antibodies 
5 to both GLRaV-2 and GLRaV-1, with a prevalence of the latter 

{Boscia (1995)). 

Several shorter closteroviruses (particle length 800 nm 
long) have also been isolated from grapevines. One of these, 
called grapevine virus A (GVA) has also been found associated, 

10 though inconsistently, with the leafroll disease (Agran et 
al., Vitis . 29:43-48 (1990); Conti, et al., Phytopat.hol . 
Mediterr. . 24:110-113 (1985); Conti et al., Phytopathology/ 
70:394-399 (1980)). The etiology of GVA is not really known; 
however, it appears to be more consistently associated with 

15 rugose wood sensu lato (Rosciglione at al., Rev. Suisse Vitic 
Arbor ic . Kortic. . 18:207-211 (1986); and Zimmermann (1990)). 
Another short closterovirus (800 nm long) named grapevine 
virus B (GVB) has been isolated and characterized from corky 
bark-affected vines (Boscia et al., Arch. Virol. r 130:109-120 

20 (1993); Namba (1991)). 

As suggested by Martelli, leafroll symptoms may be 
induced by more than one virus or they may be simply a general 
plant physiological response to invasion by an array of 
phloem-inhabiting viruses. Grapevine leafroll is induced by 

25 one (or a complex) of long closteroviruses (particle length 
1,400 to 2,200 nm) . 

Grapevine leafroll is transmitted primarily by 
contaminated scions and rootstocks. However, under field 
conditions, several species of mealybugs have been shown 

30 vectors of leafroll (Engelbrecht et al., Phvtophylacti ca . 
22:341-346 (1990); Engelbrecht et al., Phytophylactica r 
22:347-354 (1990); Rosciglione, et al . , (Abstract), 
Phytoparasi tica , 17:63-63 (1989); and Tanne, Phytoparasitica, 
16:288 (1988)). Natural spread of leafroll by insect vectors 

35 is rapid in various parts of the world. In New Zealand, 
observations of three vineyards showed that the number of 
infected vines nearly doubled in a single year (Jordan et al., 
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in-v, M^t-irr r.f tho T m -T^""* 1 council for the study of 
v^iq„« a ^ yj^* rnsga^s of the Gr a pev i ne , Montreux, 
Switzerland, pp. 113-114 (1993)). One vineyard became 90% 
infected 5 years after GLRaV-3 was first observed. Prevalence 
of leafroll worldwide may increase as chemical control of 
mealybugs becomes more difficult due to the unavailability of 
effective insecticides. 

In view of the serious risk grapevine leafroll virus 
poses to vineyards and the absence of an effective treatment, 
there is a need to prevent this disease and the resulting 
economic losses. The present invention overcomes this 
deficiency in the art. 

SUMMARY OF INVENTION 



15 



20 



25 



The present invention relates to an isolated protein or 
polypeptide corresponding to a protein or polypeptide of a 
grapevine leafroll virus. The encoding RNA and DNA molecules, 
in either isolated form or incorporated in an expression 
system, vectors, host cells, and transgenic Vitis or citrus 
scions or rootstock cultivars, are also disclosed. 

Another aspect of the present invention relates to a 
method of imparting grapevine leafroll virus resistance to 
Vitis scion or rootstock cultivars by transforming them with a 
DNA molecule encoding a protein or polypeptide of a grapevine 
leafroll virus. These DNA molecules can also be used in 
transformation of citrus scion or rootstock cultivar to impart 
tristeza virus resistance to such cultivars. 

The present invention also relates to an antibody or 
3 0 binding portion thereof or probe which recognizes the protein 
or polypeptide or the nucleic acid encoding same. 

Grapevine leafroll virus resistant transgenic variants of 
the current commercial grape cultivars and rootstocks allows 
for improved control of the virus while retaining the varietal 
35 characteristics of specific cultivars. Furthermore, these 

variants permit control of GLRaV transmitted by contaminated 
scions or rootstocks or by GLRaV- carrying mealybugs or other 
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insect pests. With respect to the latter mode of 
transmission, the present invention circumvents increasingly 
restricted pesticide use, which has made chemical concrol of 
mealybug infestations increasingly difficult. Thus, the 
interests of the environment and the economics of grape 
cultivation and wine making are benefited by the present 
invention. 



BRIEF DESCRIPTION OF TEE DRAWINGS 
10 Figures 1A-1B, illustrates the results of Northern blot 

hybridization. Figure IB shows that probe made from a clone 

insert gave positive reaction with itself (lane 3) and to 

dsRNA from leafroll infected tissues (lane 1) , but not with 

nucleic acids extracted from healthy grapevines (lane 2). 
15 Lane M contains molecular weight markers (EindHI digested 

lambda DNA) . Figure 1A depicts an ethidium bromide stained 

agarose gel before transfer to a membrane. 

Figure 2 presents an analysis of GLRaV-3 dsRNA by 

electrophoresis on an ethidium bromide stained agarose gel. A 
20 dsRNA of ca. 16 kb was readily isolated from diseased 

grapevine (lane 6) , but not from the healthy control (lane 5) . 

Other samples that were used for control were tobacco mosaic 

virus dsRNA (lane 1) ; cucumber mosaic virus dsRNA (lane 2) ; 

pBluescript vector (lane 3) and an insert of clone pC4 . X 
25 Hindi 1 1 digested lambda DNA was used as molecular weight 

markers (lane M) . 

Figure 3 is a Western blot of antibodies to GLRaV-3 that 

reacted to proteins produced by cDNA clones after IPTG 

induction in E. coli. Similar banding patterns were observed 
3 0 whether a polyclonal (panel A) or a monoclonal antibody 

(panel B) was used. Lane 1 shows clone pCP10-l; lane 2, pCP5; 

lane 3, pCP8-4; and lane 4, the native coat protein from 

GLRaV-3 infected tissue. Lane M is a prestained protein 

molecular weight marker. 
3 5 Figure 6 shows the cDNA clones containing the coding 

region for the coat protein of the NY1 isolate of GLRaV-3 . 

Three clones (pCP8-4, pCP5, pC?lC-l) were identified by 
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immunoscreening a cDNA library prepared in lambda ZAP II. Two 
other clones were aligned after plaque hybridization and 
nucleotide sequencing. The coat protein ORF is shown by an 
arrow in an open rectangle. 
5 Figure 5 is the phylogenetic tree generated using results 

obtained using the Clustal Method of MegAlign program in 
DNASTAR for the coat protein of GLRaV-3 . The coat protein of 
GLRaV-3 was incorporated into a previously described alignment 
(Dolja et al., Ann. Rev. Phytopathol . , 32:261-285 (1994)) for 

10 comparison. The other virus sequences were obtained from 

current databases: apple chlorotic leaf spot virus (ACLSV) ; 
apple stem grooving virus (ASGV) ; apple stem pitting virus 
(ASPV) ; barley yellow mosaic virus (BaMV) ; beet yellows 
closterovirus (BYV) ; diverged copies of BYV and CTV coat 

15 proteins (BYV p24 and CTV p27, respectively) ; citrus tristeza 
virus (CTV) ; grapevine virus A (GVA) ; grapevine virus B (GVB) ; 
lily symptomless virus (LSV) ; lily virus X (LVX) ; narcissus 
mosaic virus (NMV) ; pepper mottle virus (PeMV) ; papaya mosaic 
virus (PMV) ; potato virus T (PVT) ; potato virus S (PVS) ; 

20 potato virus M (PVM) ; potato virus X (PVX) ; tobacco etch 
virus (TEV) ; tobacco vein mottle virus (TVMV) ; and white 
clover mosaic virus (WcMV) • 

Figure S depicts an analysis of reverse transcription 
polymerase chain reaction (RT-PCR) to detect GLRaV-3 in a 

25 partially purified virus preparation. The original sample 

concentration is equivalent to 50 mg//il of phloem tissue (lane 
1) which was diluted by 10-fold series as 10' 1 (lane 2), 10 -2 
(lane 3), 10" 3 (lane 4), 10" 4 (lane 5), and 10" s (lane 6), 
respectively. The expected 219 bp PCR product was clearly 

30 observed up to lane 4, which is equivalent to a detection 
limit of 10 jig of phloem tissue. Lane 7 was a healthy 
control. Lane 8 was dsRNA for positive control. Lanes 9-11 
were also used for positive controls of purified viral RNA 
(lane 9), dsRNA (lane 10), and plasmid DNA (pC4) (lane 11) as 

3 5 templates, respectively. Lane M contains molecular weight 
markers (Haelll digested fX 174 DNA) . 
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Figures 7A-7B depicts comparative analysis of Nested PCR 
with immuno- capture preparations on field collected samples. 
Using a polyclonal antibcdy to GLRaV-3 for immune- capture, the 
expected 64 8 bp PCR product was not consistently observed in 
5 the first round of PCR amplification with external primers 

over a range of samples (lanes 1-7, Figure 7A) . However, the 
expected 219 bp PCR product amplified by internal primers was 
consistently observed over all seven samples (lanes 1-7, 
Figure 7B) . A similar inconsistency is also shown in a sample 

10 prepared by proteinase K- treated crude extract (compare panels 
A to B on lane 8) . With dsRNA as template, the expected PCR 
products were readily observed in both reactions (compare lane 
10 in Figure 7A and 7B) . No such products were observed on a 
healthy sample (lane 9) . Lane M contained molecular weight 

15 markers (Haelll digested fx 174 DNA) . 

Figures 8A-8B depict comparative studies on the 
sensitivity of Nested PCR with samples prepared using 
proteinase K-treated crude extract (Figure 8A, PK Nested PCR) 
and by immuno -capture preparation (Figure 8B, IC Nested PCR) . 

20 Nested PCR was performed on samples with serial 10- fold 

dilutions of up to 10" 6 in a proteinase K-treated and 10" fl in an 
immuno -capture preparation. The expected 219 bp PCR product 
was observed up to 10' 5 in PK Nested PCR and over 10 _e (the 
highest dilution used in this test) in IC Nested PCR. A 

25 similar PCR product was also observed with dsRNA template but 
not healthy grape tissues (H. CK) . Lane M contained molecular 
weight markers (Haelll digested fx 174 DNA) . 

Figure 9 shows partial genome organization for GLRaV-3 
and the cDNA clones used to determine nucleotide sequence. 

3 0 Numbered lines represent nucleotide coordinates in kilobases 
(kb) . 

Figure 10 depicts the proposed genome organization of the 
GLRaV-3 in comparison with three other closterovirus genomes, 
BYV, CTV, and LIYV (Dolja (1994)). Homologous proteins are 
35 shown by identical patterns. Papain-like proteinase (P-PRO) ; 
methyltransf erase of type 1 (MTR1) ; RNA helicase of 
superfamily 1 (HELD; RNA polymerase of supergroup 3 (PL03); 
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HSP70 -related protein (HSP70r) ; and capsid protein forming 
filamentous virus particle (CPf ) . 

Figure 11 is the phylogenetic tree showing the amino acid 
sequence relationship of the helicase of alphaviruses . The 
5 helicase domain of GLRaV-3 (291 aa) from the present study is 
used. The other virus sequences were obtained from current 
databases (Swiss-Prot and GenBank, release 84.0). Apple 
chlorotic leaf spot virus (ACLSV) ; broad bean mottle virus 
(BbMV) ; brome mosaic virus (BMV) ; beet yellow closterovirus 
10 (BYV) ; cowpea chlorotic mottle virus (CcMV) ; cucumber mosaic 
virus (CMV) ; fox mosaic virus (FxMV) ; lily symptomless virus 
(LSV) ; lily virus X (LXV) ; narcissus mosaic virus (NMV) ; pea 
early browning virus (PeBV) ; papaya mosaic virus (PMV) ; poplar 
mosaic virus (PopMV) ; peanut stunt virus (PSV) ; potato virus S 
15 (PVS) ; potato virus M (PVM) ; potato virus X (PVX) ; strawberry 
mild yellow edge-associated virus (Sm Yea V) ; tomato aspermy 
virus (TAV) ; tobacco mosaic virus (TMV) ; tobacco rattle virus 
(TRV) ; and white clover mosaic virus (WcMV) . 

Figure 12 shows the phylogenetic tree for the RNA 
20 dependent RNA polymerases {RdRp) of the alpha- like supergroup 
of positive strand RNA viruses. The deduced amino acid 
sequence of the RdRp of GLRaV-3 was incorporated into a 
previously described alignment (Dolja (1994)) for comparison. 
The other virus sequences were obtained from current 
25 databases: Apple chlorotic leaf spot virus (ACLSV); alfalfa 
mosaic virus (AlMV) ; apple stem grooving virus (ASGV) ; brome 
mosaic virus (BMV) ; beet necrotic yellow vein virus (BNYW) ; 
beet yellow virus (BYV) ; barley stripe mosaic virus (BSMV) ; 
beet yellow stunt virus (BYSV) ; cucumber mosaic virus (CMV) ; 
30 citrus tristeza virus (CTV) ; hepatitis E virus (HEV) ; potato 
virus M (PVM) ; potato virus X (PVX) ; raspberry bushy dwarf 
virus (RBDV) ; shallot virus X (SHVX) ; Sinbis virus (SNBV) ; 
tobacco mosaic virus (TMV) ; tobacco rattle virus (TRV) ; and 
turnip yellow mosaic virus (TYMV) . 
35 Figure 13 is the predicted phylogenetic relationship for 

viral and cellular HSP70 proteins. HSP70-related protein of 
GLRaV-3 (p59) was incorporated into a previously described 
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alignment (Dolja (1994)) for comparison. The sequences of 
BYV, CTV, and LIYV proteins were from Agranovsky et al . , jL. 

. VM ml . . 217:603-610 (1991), Pappu et al. , Virology, 
199:35-46 (1994), and Klaassen et al., Virology, 208:99-110 
5 (1995) , respectively. Only the N-terminal half of beet yellow 
stunt virus HSP70-related protein (Karasev et al., J, Gen, 
Vi rol . . 75:1415-1422 (1994)) is used. Other sequences were 
obtained from the Swiss-Prot database; their accession numbers 
are as follows: DNA1_BACSU, Bacillus subtilis (P13343) ; 

10 DNAKJSCOLI , Escherichia coli (P04475) ; HS70_CHICK (P08106) ; 
HS70JDNCMY, Oncorhynchus mykiss (P08108) ; HS70_PLACB, 
Plasmodium cynomolgi (Q05746) ; HS70_SCHMA, Schistosoma 
mansoni (P08418) ; HS70_XENLA, Xenopus laex'is (P02827) ; 
HS71_DR0ME, Drosophila melanogaster (P02825) ; HS71_HUMAN 

15 (P08107) ; HS71_M0USE (P17879); HS71_PIG (P34930); HS74_PARLI , 

Paracentrotus lividus (Q06248) / HS74_TRYBB, Trypanosoma brucei 
(P11145) ; and ZMHSP702, maize gene for heat shock protein 70 
exon 2 (X03697) . 

Figure 14 summarizes the strategies employed in the 

2 0 construction of the plant transformation vector pBin!9GLRaV- 
3hsp90-12-3. A plant expression cassette, in the Hindlll- 
EcoRI fragment containing CaMV 35S-35S promoters-AMV 5' 
untranslated sequence-43K ORF-Nos 3' untranslated region, was 
excised from pBI525GLRaV-3hsp90 and cloned into the similarly 

25 (restriction enzyme) treated plant transformation vector 
pBin!9. The resulting clone, pBin!9GLRaV-3hsp90-12-3 , is 
shown. Locations of important genetic elements within the 
binary plasmid are indicated: BR, right border; BL, left 
border; Nos-NPT II, plant expressible neomycin 

30 phosphotransferase gene; Lac -LAC Z, plant expressible Lac Z 

gene; and Bacterial Kan, bacterial kanamycin resistance gene. 

Figure 15 shows the Agrobacterium- binary vector 
pGA4 82G/cpGLRaV-3 , which was constructed by cloning the 
Hindi I I fragment of pEPT8cpGLRaV-3 into a derivative of pGA4 82 

35 and used for transformation via Agrobacterium or Biolistic 
approach . 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to isolated DNA molecules 
encoding the proteins or polypeptides of a grapevine leafroll 
5 virus. A substantial portion of the grapevine leafroll virus 
genome, within which are a plurality of open reading frames, 
has been sequenced by the present inventors. One such DNA 
molecule contains an open reading frame encoding grapevine 
leafroll virus helicase and comprising the nucleotide sequence 
10 corresponding to SEQ ID NO:l. The helicase has an amino acid 
sequence corresponding to SEQ ID NO: 2 and a molecular weight 
from about 146 to about 151 kDa, preferably about 148.5 kDa. 

Another such DNA molecule comprises an open reading frame 
which codes for a grapevine leafroll virus RNA-dependent RNA 
15 polymerase and comprises the nucleotide sequence corresponding 
to SEQ ID NO: 3. The RNA-dependent RNA polymerase has an amino 
acid sequence as given in SEQ ID NO: 4 and a molecular weight 
from about 59 to about 63 kDa, preferably about 61 kDa. 

Another such DNA molecule comprises an open reading frame 
20 which codes for a grapevine leafroll virus hsp70 -related 

protein or polypeptide and comprises the nucleotide sequence 
corresponding to SEQ ID NO:5. The hsp70-related protein has 
an amino acid sequence corresponding to SEQ ID NO: 6 and a 
molecular weight from about 57 to about 61 kDa, preferably 
25 about 59 kDa. 

Another such DNA molecule comprises an open reading frame 
which codes for a grapevine leafroll virus hsp90-related 
protein and comprises the nucleotide sequence corresponding to 
SEQ ID NO: 7. The hsp90-related protein has an amino acid 
3 0 sequence corresponding to SEQ ID NO: 8 and a molecular weight 
from about 53 to about 57 kDa, preferably about 55 kDa. 

Another such DNA molecule comprises an open reading frame 
which codes for a grapevine leafroll virus coat protein or 
polypeptide. The DNA molecule comprises the nucleotide 
35 sequence corresponding to SEQ ID NO:9. The coat protein has 
an amino acid sequence as given in SEQ ID NO: 10 and a 
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molecular weight from about 33 to about 43 kDa, preferably 
about 35 kDa. 

Alternatively, the DNA molecule of the present invention 
can constitute an open reading frame which codes for a first 
5 undefined protein or polypeptide. This DNA molecule comprises 
the nucleotide sequence corresponding to SEQ ID NO: 11. The 
first undefined protein or polypeptide has an amino acid 
sequence corresponding to that in SEQ ID NO: 12 and a molecular 
weight from about 5 to about 7 kDa, preferably about 6 kDa. 

10 Another such DNA molecule constitutes an open reading 

frame which codes for a second undefined grapevine leafroll 
virus protein or polypeptide and comprises the nucleotide 
sequence corresponding to SEQ ID NO: 13. The second undefined 
urotein or polypeptide has an amino acid sequence as given in 

15 SEQ ID NO: 14 and a molecular weight from about 4 to about 6 
kDa, preferably about 5 kDa. 

Another such DNA molecule constitutes an open reading 
frame which codes for a grapevine leafroll virus coat protein 
repeat and comprises the nucleotide sequence corresponding to 

20 SEQ ID NO:15. The coat protein repeat has an amino acid 

seouence as given in SEQ ID NO: 16 and a molecular weight from 
about 51 to about 55 kDa, preferably about 53 kDa. 

Yet another such DNA molecule constitutes an open reading 
frame which codes for a third undefined grapevine leafroll 

25 virus protein or polypeptide and comprises the nucleotide 

sequence corresponding to SEQ ID NO: 17. The third undefined 
protein or polypeptide has an amino acid sequence as given in 
SEQ ID NO: 18 and a molecular weight from about 33 to about 3 9 
kDa, preferably about 36 kDa. 

30 Yet another DNA molecule which constitutes an open 

reading frame for a fourth undefined grapevine leafroll virus 
protein or polypeptide comprises the nucleotide seuqence 
corresponding to SEQ ID NO: 19. The fourth undefined protein 
or polypeptide has an amino acid sequence as given in SEQ ID 

3 5 NO: 20 and a molecular weight from about 17 to about 23 kDa, 
preferably about 20 kDa. 



WO 97/22700 



14 



PCT/US96/20747 



Yet another DNA molecule constitutes an open reading 
frame for a fifth undefined grapevine leafroll virus protein 
or polypeptide and comprises the nucleotide sequence 
corresponding to SEQ ID NO: 21. The fifth undefine protein or 
5 polypeptide has an amino acid sequence as given in SEQ ID 
NO: 22 and a molecular weight from about 17 to about 23 kDa, 
preferably about 20 kDa. 

Yet another DNA molecule of the present invention 
consitutes an open reading frame for a sixth undefined protein 

10 or polypeptide and comprises the nucleotide sequence 

corresponding to SEQ ID NO: 23. The sixth undefined protein or 
polypeptide has an amino acid sequence as given in SEQ ID 
NO: 24 and a molecular weight from about 5 to about 9 kDa, 
preferably about 7 kDa. 

15 Also encompassed by the present invention are fragments 

of the DNA molecules of the present invention. Suitable 
fragments capable of imparting grapevine leafroll resistance 
to grape plants are constructed by using appropriate 
restriction sites, revealed by inspection of the DNA 

20 molecule's sequence, to: (i) insert an interposon (Felley et 
al., Sena, 52:147-15 (1987), which is hereby incorporated by 
reference) such that truncated forms of the grapevine leafroll 
virus coat polypeptide or protein, that lack various amounts 
of the C- terminus, can be produced or (ii) delete various 

25 internal portions of the protein. Alternatively, the sequence 
can be used to amplify any portion of the coding region, such 
that it can be cloned into a vector supplying both 
transcription and translation start signals suitable for the 
desired host cell. 

30 Variants may also (or alternatively) be modified by, for 

example, the deletion or addition of nucleotides that have 
minimal influence on the properties, secondary structure and 
hydropathic nature of the encoded polypeptide. For example, 
the nucleotides encoding a polypeptide may be conjugated to a 

35 signal (or leader) sequence at the N-terminal end of the 
protein which co-translationally or post-translationally 
directs transfer of the protein. The nucleotide sequence may 
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also be altered so that the encoded polypeptide is conjugated 
to a linker or other sequence for ease of synthesis, 
purification, or identification of the polypeptide. 

The protein or polypeptide of the present invention is 
5 preferably produced in purified form (preferably, at least 
about 80%, more preferably 90%, pure) by conventional 
techniques. Typically, the protein or polypeptide of the 
present invention is isolated after lysing or sonication. 
After washing, the lysate pellet is resuspended in buffer 

10 containing Tris-HCl. During dialysis, a precipitate forms 

from this protein solution. The solution is centrifuged, and 
the pellet is washed and resuspended in the buffer containing 
Tris-HCl. Proteins are resolved by electrophoresis through an 
SDS 12% polyacrylamide gel. 

15 The DNA molecule encoding the grapevine leafroll virus 

protein or polypeptide of the present invention can be 
incorporated in cells using conventional recombinant DNA 
technology. Generally, this involves inserting the coding 
sequence into an expression system to which the DNA molecule 

20 is heterologous (i.e. not normally present) . The heterologous 
DNA molecule is inserted into the expression system or vector 
in proper sense orientation and correct reading frame. The 
vector contains the necessary elements for the transcription 
and translation of the inserted protein-coding sequences as 

25 well known in the art. 

U.S. Patent No. 4,237,224 (Cohen and Boyer) , hereby 
incorporated by reference, describes the production of 
expression systems in the form of recombinant plasmids using 
restriction enzyme cleavage and ligation with DNA ligase. 

30 These recombinant plasmids are then introduced, e.g., by 
transformation, and replicated in unicellular cultures 
including procaryotes and eucaryotic cells grown in culture. 

Recombinant genes may also be introduced into virus 
vectors, such as vaccinia virus. Recombinant viruses can be 

35 generated by transfection of plasmids into cells infected with 
virus . 
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Suitable vectors include, but are not limited to, the 
following viral vectors such as lambda vector system gtll, gt 
WES.tB, Charon 4, and plasmid vectors such as p3R322, pBR325, 
PACYC177, pACYC184 , pUC8, pUC9 , pUC18, pUCl9, pLG339, pR290, 
5 pKC37 # pKClOl, SV 40, pBluescript II SK +/- or KS +/- (see 

Stratagpni* Cloni ng Systems Catalog (1993) from Stratagene, La 
Joila, CA, hereby incorporated by reference), pQE, pIH821, 
pGEX, pET series (see Studier et . al . , Gene Ex pression 
Technology , vol. 185 (199 0) , hereby incorporated by 

10 reference), and any derivatives thereof. Recombinant 

molecules can be introduced into cells via transformation, 
transduction, conjugation, mobilization, or electroporation. 
The DNA sequences are cloned into the vector using standard 
cloning procedures in the art, as described by Maniatis et 

15 al., Molecular Cloning: A Laboratory Manual . Cold Springs 
Laboratory, Cold Springs Harbor, New York (1982) , which is 
hereby incorporated by reference . 

A variety of host-vector systems may be utilized to 
express the protein-encoding sequence (s). Primarily, the 

20 vector system must be compatible with the host cell used. 
Host -vector systems include but are not limited to the 
following: bacteria transformed with bacteriophage DNA, 
plasmid DNA, or cosmid DNA; microorganisms such as yeast 
containing yeast vectors; mammalian cell systems infected with 

25 virus (e.g., vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus (e.g., baculovirus) ; and plant 
cells infected by bacteria or transformed via particle 
bombardment (i.e. biolistics) . The expression elements of 
these vectors vary in their strength and specificities. 

3 0 Depending upon the host -vector system utilized, any one of a 
number of suitable and well known transcription and 
translation elements can be used. 

Different genetic signals and processing events control 
many levels of gene expression (e.g., transcription and 

35 messenger RNA (mRNA) translation) . Transcription of DNA is 

dependent upon the presence of a promotor, a DNA sequence that 
directs the binding of RNA polymerase £ind thereby promotes 
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mRNA synthesis. The DNA sequences of eucaryotic promotors 
differ from those of procaryotic promotors. Furthermore/ 
eucaryotic promotors and accompanying genetic signals, 
including enhancer- like sequences and inducible regulatory 
5 sequences, may not be recognized in or may not function in a 
procaryotic system, and, further, procaryotic promotors are 
usually not recognized and do not function in eucaryotic 
cells. 

Similarly, translation of mRNA in procaryotes depends 

10 upon the presence of the proper procaryotic signals, which 
differ from those of eucaryotes. Efficient translation of 
mRNA in procaryotes requires a ribosome binding site (Shine- 
Dalgarno (SD) sequence) on the mRNA. This sequence is a short 
nucleotide sequence that is located before the start codon, 

15 usually AUG, which encodes the N- terminal methionine of the 

protein. The SD sequences are complementary to the 3 1 -end of 
the 16S rRNA (ribosomal RNA) and promote binding of mRNA to 
ribosomes by duplexing with rRNA to allow correct positioning 
of the ribosome. For a review on maximizing gene expression, 

20 see Roberts and Lauer, Mgt-hnds in Enzvmoloery . 68:473 (1979), 
which is hereby incorporated by reference. 

Promotors vary in their "strength" (i.e. their ability to 
promote transcription) . It is generally desirable to use 
strong promotors in order to obtain a high level of 

25 transcription and, hence, expression of the cloned gene of 

interest. Depending upon the host cell system utilized, any 
one of a number of suitable promotors may be used. For 
instance, when cloning in E. coli, its bacteriophages, or 
plasmids, promotors such as the T7 phage promoter, lac 

30 promotor, trp promotor, recA promotor, ribosomal RNA promoter, 
the P R and P L promotors of coliphage lambda and others, 
including but not limited, to lacUVS, ompF, Jbla, Ipp, and the 
like, may be used to direct high levels of transcription of 
adjacent DNA segments. Additionally, a hybrid trp-lacUV5 

35 (tac) promotor or other E. coli promotors produced by 

recombinant DNA or other synthetic DNA techniques may be used 
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to provide for transcription of the inserted coding sequence 
or other inserted nucleic acid. 

Bacterial host cell strains and expression vectors may be 
chosen which inhibit the action of the promotor unless 
5 specifically induced. In certain operons, the addition of 

specific inducers is necessary for efficient transcription of 
the inserted DNA. For example, the lac operon is induced by 
the addition of lactose or IPTG (isopropylthio-beta-D- 
galactoside) . A variety of other operons, such as trp, pro, 

10 etc., have different under regulatory mechanisms. 

Specific initiation signals are also required for 
efficient gene transcription and translation in procaryotic 
cells. These transcription and translation initiation signals 
may vary in "strength" as measured by the quantity of gene 

15 specific messenger RNA and protein synthesized, respectively. 

The DNA expression vector, which contains a promotor, may also 
contain any combination of various "strong" transcription 
and/or translation initiation signals. For instance, 
efficient translation in E. coli requires a SD sequence about 

20 7-9 bases 5' to the initiation codon (ATG) to provide a 

ribosome binding site. Thus, any SD-ATG combination that can 
be utilized by host cell ribosomes may be employed. Such 
combinations include, but are not limited to, the SD-ATG 
combination from the cro gene or the N gene of coliphage 

25 lambda, or from the E. coli trp E, D, C, B or A genes. 

Additionally, any SD-ATG combination produced by recombinant 
DNA or other techniques involving incorporation of synthetic 
nucleotides may be used. 

Once the isolated DNA molecules encoding the various 

30 grapevine leaf roll virus proteins or polypeptides, as 

described above, have been cloned into an expression system, 
they are ready to be incorporated in a host cell. Such 
incorporation can be carried out by the various forms of 
transformation noted above, depending upon the vector/host 

3 5 cell system. Suitable host cells include, but are not limited 
to, bacteria, yeast, mammalian cells, insect, plant, and the 
like. 
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The present invention also relates to RNA molecules which 
encode the various grapevine leafroll virus proteins or 
polypeptides described above. The transcripts can be 
synthesized using the host cells of the present invention by 
5 any of the conventional techniques. The mRNA can be 

translated either in vitro or in vivo. Cell-free systems 
typically include wheat -germ or reticulocyte extracts. 

One aspect of the present invention involves using one or 
more of the above DNA molecules encoding the various proteins 

10 or polypeptides of* a grapevine leafroll virus to transform 

grape plants in order to impart grapevine leafroll resistance 
to the plants. The mechanism by which resistance is imparted 
is not known. As hypothesized, the transformed plant can 
express the coat protein or polypeptide, and, when the 

15 transformed plant is inoculated by a grapevine leafroll virus, 
such as GLRaV-1, GLRaV-2, GLRav-3, GLRaV-4, GLRaV-5, or 
GLRaV-6, or combinations of these, the recombinantly expressed 
coat protein or polypeptide surrounds the virus, thereby 
preventing translation of the viral DNA. 

20 In this aspect of the present invention the subject 

coding sequence incorporated in the plant can be 
constitutively expressed. Alternatively, expression can be 
regulated by a promoter which is activated by the presence of 
grapevine leafroll virus. Suitable promoters for these 

25 purposes include those from genes expressed in response to 
grapevine leafroll virus infiltration. Additional suitable 
plant promoters include those which induce downstream gene 
expression in response to wounding, in response to elicitors 
and in response to virus infection. In the alternative, a 

30 constitutive plant -expressible promoter can be used; it is 
preferred that the level of gene expression is sufficiently 
high to provide virus resistance but not so high as to be 
detrimental to the normal functioning of the cell and tissues 
in which it is expressed. Immediately upstream of the start 

35 of a coding sequence for a GLRaV-3 protein or polypeptide in 

an expression system (expression vector, for use in plants) it 
is desired that there be a Kozak consensus for translation 
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intiation (AAXXATGG, where X is any of the four nucleotides) . 
Downstream of the end of the coding sequence for the virus 
protein or polypeptide, it is preferred that there be a 
polyadenylation signal functional in plants, such as that from 
5 the nopaline synthase gene, the octopine synthase gene or from 
the CaMV 35S gene. These sequences are well known in the 
plant biotechnology art. 

The DNA coding sequences and/or molecules of the present 
invention can be utilized to impart grapevine leafroll 
10 resistance for a wide variety of grapevine plants. The DNA 

molecules are particularly well suited to imparting resistance 
to Vitis scion or rootstock cultivars. Scion cultivars which 
can be protected include those commonly referred to as Table 
or Raisin Grapes, such as Alden, Almeria, Anab-E-Shahi , Autumn 
15 Black, Beauty Seedless, Black Corinth, Black Damascus, Black 
Malvoisie, Black Prince, Blackrose, Bronx Seedless, Burgrave, 
Calmeria, Campbell Early, Canner, Cardinal, Catawba, 
Christmas, Concord, Dattier, Delight, Diamond, Dizmar, 
Duchess, Early Muscat, Emerald Seedless, Emperor, Exotic, 
Ferdinand de Lesseps, Fiesta, Flame seedless, Flame Tokay, 
Gasconade, Gold, Himrod, Hunisa, Hussiene, Isabella, Italia, 
July Muscat, Khandahar, Katta, Kourgane, Kishmishi, Loose 
Perlette, Malaga, Monukka, Muscat of Alexandria, Muscat Flame, 
Muscat Hamburg, New York Muscat, Niabell, Niagara, Olivette 
blanche, Ontario, Pierce, Queen, Red Malaga, Ribier, Rish 
Baba, Romulus, Ruby Seedless, Schuyler, Seneca, Suavis (IP 
365), Thompson seedless,, and Thomuscat. They also include 
those used in wine production, such as Aleatico, Alicante 
Bouschet, Aligote, Alvarelhao, Aramon, Baco blanc (22A) , 
Burger , Cabernet franc , Cabernet , Sauvignon , Calzin , 
Carignane, Charbono, Chardonnay, Chasselas dore, Chenin blanc, 
Clairette blanche, Early Burgundy, Emerald Riesling, Feher 
Szagos, Fernao Pires, Flora, French Colombard, Fresia, 
Furmint, Gamay, Gewurztraminer , Grand noir, Gray Riesling, 
35 Green Hungarian, Green Veltliner, Grenache, Grillo, Helena, 
Inzolia, Lagrein, Lambrusco de Salamino, Malbec, Malvasia 
bianca, Mataro, Melon, Merlot, Meunier, Mission, Montua de 
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Pilas, Muscadelle du Bordelais, Muscat blanc, Muscat Ottonel, 
Muscat Saint -Vallier, Nebbiolo, Nebbiolo fino, Nebbiolo 
Lampia, Orange Muscat, Palomino, Pedro Ximenes, Petit 
Bouschet, Petite Sirah, Peverella, Pinot noir, Pinot Saint- 
George, Priraitivo di C-ioa, Red Veltliner, Refosco, Rkatsiteli, 
Royalty, Rubired, Ruby Cabernet, Saint -Emilion, Saint Macaire,' 
Salvador, Sangiovese, Sauvignon blanc, Sauvignon gris, 
Sauvignon vert, Scarlet, Seibel 5279, Seibel 9110, Seibel 
13053, Semillon, Servant, Shiraz, Souzao, Sultana Crimson, 
Sylvaner, Tannat, Teroldico, -Tinta Madeira, Tinto cao, 
Touriga, Traminer, Trebbiano Toscano, Trousseau, Valdepenas, 
Viognier, Walschriesling, White Riesling, and Zinfandel" 
Rootstock cultivars which can be protected include Couderc 
1202, Couderc 1613, Couderc 1616, Couderc 3309, Dog Ridge, 
Foex 33 EM, Freedom, Ganzin 1 (A x R #1) , Harmony, Kober 5BB, 
LN33, Millardet & de Grasset 41B, Millardet & de Grasset 420A, 
Millardet & de Grasset 101-14, Oppenheim 4 <S04) , Paulsen 775, 
Paulsen 1045, Paulsen 1103, Richter 99,_Richter 110, Riparia 
Gloire, Ruggeri 225, Saint -George, Salt Creek, Teleki 5A, 
Vitis rupestris Constantia, Vitis California, and Vitis 
girdiaixa. 

There is extensive similarity in the hsp70-related 
sequence regions of GLRaV-3 and other closteroviruses, such as 
tristeza virus. Consequently, the GLRaV-3 hsp70-related gene 
can also be used to produce transgenic cultivars other than 
grape, such as. citrus, which are resistant to closteroviruses 
other than grapevine leaf roll, including tristeza virus. 
These include cultivars of lemon, lime, orange, grapefruit, 
pineapple, tangerine, and the like, such as Joppa, Maltaise 
Ovale, Parson (Parson Brown) , Pera, Pineapple, Queen, 
Shamouti, Valencia, Tenerife, Imperial Doblefina, Washington 
Sanguine, Moro, Sanguinello Moscato, Spanish Sanguinelli, 
Tarocco, Atwood, Australian, Bahia, Baiana, Cram, Dalmau, 
Eddy, Fisher, Frost Washington, Gillette, LengNavelina, 
Washington, Satsuma Mandarin, Dancy, Robinson, Ponkan, Duncan, 
Marsh, Pink Marsh, Ruby Red, Red Seedless, Smooth Seville, 
Orlando Tangeic, Eureka, Lisbon, Meyer Lemon', Rough Lemon, 
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Sour Orange, Persian Lime, West Indian Lime, Bearss, Sweet 
Lime, Troyer Citrange, and Citrus trifoliata. 

Plant tissue suitable for transformation include leaf 
tissue, root tissue, meristems, zygotic and somatic embryos, 
5 and anthers. It is particularly preferred to utilize embryos 
obtained from anther cultures. 

The expression systems of the present invention can be 
used to transform virtually any plant tissue under suitable 
conditions. Tissue transformed in accordance with the present 

10 invention can be grown in vitro in a suitable medium to impart 
grapevine leaf roll virus resistance. Transformed cells can be 
regenerated into whole plants such that the protein or 
polypeptide imparts resistance to grapevine leafroll virus in 
the intact transgenic plants. In either case, the plant cells 

15 transformed with the recombinant DNA expression system of the 
present invention are grown and express one of the above- 
described grapevine leafroll virus proteins or polypeptides 
and, thus, grapevine leafroll resistance. 

One technique of transforming plants with the DNA 

2 0 molecules of the present invention is by contacting the tissue 
of such plants with an inoculum of a bacterium transformed 
with a vector comprising a gene of the present invention which 
imparts grapevine leafroll resistance. Generally, this 
procedure involves inoculating the plant tissue with a 

25 suspension of bacteria and incubating the tissue for 48 to 
72 hours on regeneration medium without antibiotics at 25- 
28 °C. Cells of the genus Agrobacterium can be used to 
transform plant cells and/or plant tissue. Suitable species 
include Agrobacterium tumefaciens and Agrobacterium 

30 rhizogenes . A. tumefaciens (e.g., strains C5B , LBA4404, or 
EHA105) is particularly useful due to its well-known ability 
to transform plants, plant tissue and plant cells. 

Another approach to transforming plant cells with a gene 
which imparts resistance to pathogens is particle bombardment 

35 (also known as biolistic transformation) of the host cell. 
This can be accomplished in one of several ways. This 
technique is disclosed in U.S. Patent Nos . 4,945,050, 
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5,036,006, and 5,100,792, all to Sanford et al . , and in 
Emerschad et al., Plant Cell Reports. 14:6-12 (1995)), all 
hereby incorporated by reference. Generally, this procedure 
involves propelling inert or biologically active particles at 
5 the cells under conditions effective to penetrate the outer 

surface of the cell and to be incorporated within the interior 
thereof- When inert particles are utilized, the vector can be 
introduced into the cell by coating the particles with the 
vector containing the heterologous DNA. Alternatively, the 

10 target cell can be surrounded by the vector so that the vector 
is carried into the cell by the wake of the particle. 
Biologically active particles (e.g., dried bacterial cells 
containing the vector and heterologous DNA) can also be 
propelled into plant cells. 

15 Once grape plant tissue is transformed in accordance with 

the present invention, it is regenerated to form a transgenic 
grape plant. Generally, regeneration is accomplished by 
culturing transformed tissue on medium containing the 
appropriate growth regulators and nutrients to allow for the 

20 initiation of shoot meristems. Appropriate antibiotics are 
added to the regeneration medium to inhibit the growth of 
Agrrobacterium and to select for the development of transformed 
cells. Following shoot initiation, shoots are allowed to 
develop in tissue culture and are screened for marker gene 

25 activity. 

The DNA molecules of the present invention can be 
transcribed into mRNA, which, although encoding a grapevine 
leaf roll virus protein or polypeptide, is not translated to 
the corresponding protein. This is known as RUA-mediated 

30 resistance. When a Vitis scion or rootstock cultivar is 

transformed with such a DNA molecule, the DNA molecule can be 
transcribed under conditions effective to maintain the mRNA in 
the plant cell at low level density readings. Density 
readings of between 15 and 50 using a Hewlet ScanJet and Image 

35 Analysis Program are preferred. 
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The grapevine leafroll virus protein or polypeptide can 
also be used to raise antibodies or binding portions thereof 
or probes. The antibodies can be monoclonal or polyclonal. 
Monoclonal antibody production may be effected by 
5 techniques which are well-known in the art. Basically, the 
process involves first obtaining immune cells (lymphocytes) 
from the spleen of a mammal (e.g., mouse) which has been 
previously immunized with the antigen of interest either in 
vivo or in vitro. The antibody- secreting lymphocytes are then 

10 fused with (mouse) myeloma cells or transformed cells, which 
are capable of replicating indefinitely in cell culture, 
thereby producing an immortal, immunoglobulin- secreting cell 
line. The resulting fused cells, or hybridomas, are cultured, 
and the resulting colonies screened for the production of the 

15 desired monoclonal antibodies . Colonies producing such 

antibodies are cloned, and grown either in vivo or in vitro to 
produce large quantities of antibody. A description of the 
theoretical basis and practical methodology of fusing such 
cells is set forth in Kohler and Milstein, Nature r 256:495 

20 (1975), incorporated by reference. 

Mammalian lymphocytes are immunized by in vivo 
immunization of the animal (e.g., a mouse) with the protein or 
polypeptide of the present invention. Such immunizations are 
repeated as necessary at intervals of up to several weeks to 

25 obtain a sufficient titer of antibodies. Following the last 
antigen boost, the animals are sacrificed and spleen cells 
removed . 

Fusion with mammalian myeloma cells or other fusion 
partners capable of replicating indefinitely in cell culture 

30 is effected by standard and well-known techniques, for 

example, by using polyethylene glycol (PEG) or other fusing 
agents. (See Milstein and Kohler, Eur. J. Tmrnunnl . r 6:511 
(1976), incorporated by reference.) This immortal cell line, 
preferably murine, but may also be derived from cells of other 

3 5 mammalian species, including but not limited to rats and 

humans, is selected to be deficient in enzymes necessary for 
the utilization of certain nutrients, capable of rapid growth, 
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and having good fusion capability. Many such cell lines are 
known to those skilled in the art, and others are regularly 
described. 

Procedures for raising polyclonal antibodies are also 
5 well known. Typically, such antibodies can be raised by 
administering the protein or polypeptide of the present 
invention subcutaneously to New Zealand white rabbits which 
have first been bled to obtain pre -immune serum. The antigens 
can be injected at a total volume of 100 fil per site at six 

10 different sites. Each injected material will contain 

synthetic surfactant adjuvant pluronic polyols, or pulverized 
acrylamide gel containing the protein or polypeptide after 
SDS-polyacrylamide gel electrophoresis. The rabbits are then 
bled two weeks after the first injection and periodically 

15 boosted with the same antigen three times every six weeks. A 
sample of serum is then collected 10 days after each boost. 
Polyclonal antibodies are then recovered from the serum by 
affinity chromatography using the corresponding antigen to 
capture the antibody. Ultimately, the rabbits are euthanized 

20 with pentobarbital 150 mg/Kg IV. This and other procedures 

for raising polyclonal antibodies are disclosed in Harlow et . 

al., editors, Antibodies; A Laboratory Manual (1968), which 

is hereby incorporated by reference. 

In addition to utilizing whole antibodies, binding 

25 portions of such antibodies can be used. Such binding 

portions include Fab fragments, F(ab , ) 2 fragments, and Fv 
fragments. These antibody fragments can be made by 
conventional procedures, such as proteolytic^ fragmentation 
procedures, as described in Goding, Monoclonal Antibodies: 

30 Principle and Prartir^ r New York : Academic Press, pp. 98-118 
(19B3), hereby incorporated by reference. 

The present invention also relates to probes found either 
in nature or prepared synthetically by recombinant DNA 
procedures or other biological procedures. Nucleic acid 

35 probes can also be synthesized by manual chemical synthesis 
(see, e.g., Beaucage and Caruthers (1981) T^t^a . t.«i- t . 
22:1859-1862; MAtteuci et al . (1981) J. Am. rh<*™ | g or < 
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103:3185) or by automated chemical synthesis using 
commercially available equipment (e.g., Applied Biosystems, 
Foster City, CA) . Suitable probes are molecules which bind to 
grapevine leafroll viral antigens identified by the monoclonal 
5 antibodies of the present invention. Such probes can be, for 
example, proteins, peptides, lectins, or nucleic acid probes. 

The antibodies or binding portions thereof or probes can 
be administered to grapevine leafroll virus infected scion 
cultivars or rootstock cultivars. Alternatively, at least the 

10 binding portions of these antibodies can be sequenced, and the 
encoding DNA synthesized. The encoding DNA molecule can be 
used to transform plants together with a promoter which causes 
expression of the encoded antibody or binding portion thereof 
when the plant is infected by grapevine leafroll virus. In 

15 either case, the antibody or binding portion thereof or probe 
will bind to the virus and help prevent the usual leafroll 
response . 

Antibodies raised against the proteins or polypeptides of 
the present invention or binding portions of these antibodies 

20 can be utilized in a method for detection of grapevine 

leafroll virus in a sample of tissue, such as tissue from a 
grape scion or rootstock. Antibodies or binding portions 
thereof suitable for use in the detection method include those 
raised against a helicase, an RNA-dependent RNA polymerase, an 

25 hsp70-related, an hsp90- related, or a coat protein or 

polypeptide in accordance with the present invention. Any 
reaction of the sample with the antibody is detected using a 
reporter or other assay system which indicates the presence of 
grapevine leafroll virus in the sample. A variety of assay 

30 systems can be employed, such as enzyme - linked immunosorbent 
assays, radioimmunoassays, gel diffusion precipitin reaction 
assays, immunodiffusion assays, agglutination assays, 
fluorescent immunoassays, protein A immunoassays, or 
Immunoelectrophoresis assays. 

35 Alternatively, grapevine leafroll virus can be detected 

in such a sample using a nucleotide sequence of the DNA 
molecule, or a fragment thereof, encoding for a protein or 
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polypeptide (or a portion thereof) of the present invention. 
The nucleotide sequence is provided as a probe in a nucleic 
acid hybridization assay or a specific gene amplification 
detection procedure (e.g., using a polymerase chain reaction 
5 procedure) . Any reaction with the probe is detected so that 
the presence of grapevine leafroll virus in the sample is 
indicated. 

The following examples are provided to illustrate 
embodiments of the present invention but are by no means 
10 intended to limit its scope. References cited in the Examples 
are incorporated by reference herein. 

EXAMPLES 

Example i - Mate rials and Methods 

15 

Virus purification and dsRNA isolation. The NY1 isolate, 
also referred to as isolate GLRaV 109 by Golino, Amer. J. 
Enol . Vifcic . 43:200-205 (1992), a member of GLRaV- 3 (Hu et 
al., J. Phytona f-hcvL . (Bgrl . ) . 128:1-14 (1990)); Zee et al., 

20 Phy t opa t ho 1 nay . 77:1427-1434 (1987)) was used throughout this 
work. Leaf roll-diseased canes and mature leaves were 
collected from a vineyard in central New York State, and kept 
at -20°C until used. GLRaV- 3 virus particles were purified 
according to the method described by Zee (1987), and modified 

25 later by Hu (1990) . After two cycles of Cs 2 S0 4 gradient 
purification, virus particles were observed from virus- 
enriched fractions by negative staining on an electron 
microscope. 

The dsRNA was extracted from scraped bark/phloem tissue 
30 of canes as described in Hu (1990) . Briefly, total nucleic 
acid was extracted with phenol/chloroform; dsRNA was absorbed 
on a CF-11 cellulose column under 17% ethanol and eluted 
without ethanol. After two cycles of ethanol precipitation, 
dsRNA was analyzed by electrophoresis on a 6% polyacrylamide 
35 or 1% agarose gel. A high Mr dsRNA (-16 kb) along with 
several smaller Mr dsRNAs was consistently identified in 
leafroll diseased but not in healthy samples (Hu (1990) . The 
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16 kb dsRNA, which was presumably a replicative form cf the 
virus, was purified further following separation on a low 
melting temperature -agarose gel (Sambrook et al., MqIq^I ar 
Cloning. A T.ahoratorv Manual. 2nd T?d . . Cold Spring Harbor 
5 Laboratory Press (1989) . The double-stranded nature of the 
dsRNA was confirmed by its resistance to DNase and RNase in 
high salt and sensitivity to RNase in water (Hu (1990) . 

cDNA synthesis and molecular cloning. Complementary DNA 
(cDNA) was prepared by the procedure of Gubler et al., Sena, 

10 25:263 (1983), and modified for dsRNA by Jelkmann et al., 
Phytonai-hfilnpy. 79:1250-1253 (1989)., Briefly, following 
denaturation of about 2 \ig of dsRNA in 20 mM methylmercuric 
hydroxide (MeHg) for 10 min, the first-strand cDNA was 
synthesized by avian myeloblastosis virus (AMV) -reverse 

15 transcriptase using random primers (Boehringer Mannheim, 

Indianapolis, IN) . The second-strand cDNA was synthesized 
with DNA polymerase I while RNA templates were treated with 
RNase H. The cDNA was size -fractionated on a CL-4B Sepharose 
column and peak fractions, which contained larger molecular 

20 weight cDNA, were pooled and used for cloning. Complementary 
DNA ends were blunted with T4 DNA polymerase, and EcoRI 
adapters were ligated onto a portion of the blunt -ended cDNA. 
After treatment with T4 polynucleotide kinase and removal of 
unligated adapters by spin column chromatography, the cDNA was 

25 ligated with lambda ZAPII/EcoRI prepared arms (Stratagene, La 
Jolla, CA) . These recombinant DNAs were packaged in vitro 
with GIGAPACK II GOLD™ packaging extract according to the 
manufacturer's instruction (Stratagene). The packaged phage 
particles were used to infect bacteria, E. coli XLl-blue 

30 cells. 

Screening the cDNA library. To select GLRaV-3 dsRNA 
specific cDNA clones, probes were prepared from UNI -AMP™ 
(Clontech, Palo Alto, CA) PCR-amplif ied cDNA. PCR-amplif ied 
GLRaV-3 cDNA was labeled with 32 P [a-dATP] by Klenow fragment 
3 5 of E. coli DNA polymerase I with random primers and used as a 
probe for screening the library (Feinberg et al., Analytic 
nrhem. . 132:6-13 (1983)). Library screening was carried out 
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by transferring plaques grown overnight onto GENES CREEN PLUS™ 
filters, following the manufacturer's instructions for 
denaturation, prehybridization, and hybridization (Dupont, 
Boston, MA) . After washing, an autoradiograph was developed 
5 after exposing Kodak X-OMAT film to the washed filters 

overnight at -80°C. Bacteriophage recombinants were converted 
into plasmids {In vivo excision) following the manufacturer's 
instruction. (Stratagene) . 

Identification of the coat protein gene was done by 

10 immunoscreening the cDNA library with GLRaV-3 specific 
polyclonal (Zee (1987)) and monoclonal (Hu (1990)), 
antibodies . Degenerate primer ( 5 ' GGNGGNGGNACNTTYGAYGTNTCN 
(SEQ. ID. No. 19), I=inosine, Y=T or C) generated from a 
conserved amino acid sequence in Motif C of the BYV HSP70 gene 

15 (p65) was used to select HSP70 positive clones. Further 
sequence extension was made possible by the clone walking 
strategy, which used sequences that flanked the sequence to 
probe the library for a clone that contained an insert 
extending farther in either 5' or 3 1 direction. 

20 Northern blot hybridization. Inserts from selected 

clones were labeled with 32 P[a-dATP] by Klenow fragment of E. 
coli DNA polymerase I (Feinberg (1983) ) , and used as probes to 
test their specific reactions to dsRNAs isolated from leafroll 
infected tissues. Double -stranded RNA isolated from GLRaV-3 

25 infected vines was separated by electrophoresis on a 1% 

agarose gel (nondenatured condition) , denatured with 50 mM 
NaOH, O.fi M NaCl for 3 0 min at room temperature, and 
neutralized with 1.5 M NaCl, 0.5 M Tris-HCl, pH 7.5 for 
another 3 0 min. Denatured dsRNA was sandwich-blotted onto a 

30 GENES CREEN PLUS™ membrane. Prehybridization and hybridization 
were carried out in a manner similar to that described above. 
The membrane was washed and exposed to Kodak X-OMAT film, and 
an autoradiograph was developed. 

Identification of inrmunopoeitive clones. For 

35 immunoscreening, plates with plaques appearing after 8-12 h 
incubation at 37°C were overlaid with a 10 mM isopropyl- p-D- 
thio-galactopyranoside ( IPTG) 
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impregnated Nylon filters (GENE SCREEN PLUS™) and incubated for 
an additional 3-4 h. After blocking with 3% bovine serum 
albumin (BSA) , the blotted filter was incubated in a 1:1000 
dilution of alkaline phosphatase-conjugated GLRaV-3 polyclonal 
5 antibody for 3 h at 37°C. Positive signals (purple dots) were 
developed by incubation of washed filters in a freshly 
prepared nitroblue tetrazolium (NBT) and 5-bromo-4-chloro-3 - 
indolyl phosphate (BCIP) solution. To further confirm whether 
or not a true GLRaV-3 coat protein expression plaque was 

10 selected, a secondary immunoscreening was carried out by 
reinfection of bacterial XL1 Blue cells with an earlier 
selected plaque. 

Western blot analysis. After secondary immunoscreening, 
GLRaV-3 antibody positive plaques were converted into plasmid, 

15 the pBluescript, by in vivo excision. Single colonies were 
picked up and cultured in LB medium with 100 /xg/ml of 
ampicillin until mid- log growth. Fusion protein expression 
was induced by addition of 10 mM IPTG with an additional 3 h 
of incubation at 37°C. Bacteria was pelleted and denatured by 

20 boiling in protein denaturation buffer (Sambrook (1989)). An 
aliquot of 5 /xl denatured sample was loaded and separated by 
electrophoresis on a 12% SDS-polyacrylamide gel along with a 
prestained protein molecular weight marker (Bio-Rad, Hercules, 
CA) . The separated proteins were transferred onto an 

25 Immobulon membrane (Millipore) with an electroblotting 
apparatus (Bio-Rad) . After blocking with 3% BSA, the 
transferred membrane was incubated with 1:1,000 dilution of 
either GLRaV-3 polyclonal or monoclonal antibody/alkaline 
phosphatase conjugate. A positive signal was developed after 

30 incubation of the washed membrane in NBT and BCIP. 

PCR analysis. To analyze a cloned insert, an aliquot of 
a bacterial culture was used directly in PCR amplification 
with common vector primers (SK and KS) . PCR- amplified product 
was analyzed by electrophoresis on an agarose gel. 

35 Nucleotide sequencing and computer sequence analysis. 

Plasmid DNA, purified by either a CsCl method (Sambrook 
(1989)) or a modified mini alkaline-lysis/PEG precipitation 
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procedure {Applied Biosystems • Instruction) , was sequenced 
either with Sequenase version 2 kit following the 
manufacturer's instruction (US Biochemical, Cleveland, Ohio) 
or with Tag DYEDEOXY™ terminator cycle sequencing kit (Applied 
Biosystems, Inc.). Automated sequencing was conducted on an 
A3 13 73 automated sequencer. 

Nucleotide sequences were analyzed using a Genetics 
Computer Group (GCG) sequence analysis software package 
(Madison, WI) . Sequence fragments were assembled using 
Newgelstart to initiate the GCG fragment assembly system and 
to support automated fragment assembly in GCG Version 7.2. 

Computer-assisted analysis of phylogenetic relationships. 
Amino acid sequences were either obtained from database Swiss - 
Prot or translated from nucleotide sequences obtained from 
GenBank. A phylogenetic tree depicting a predicted 
relationship in the evolution of the GLRaV-3 coat protein 
sequence with those of other filamentous plant viruses was 
generated using the dustal Method of the DNASTAR 1 s MegAlign 
program (Madison, WI) . with the Clustal method, a preliminary 
phylogeny is derived from the distances between pairs of input 
sequences and the application of the UPGMA algorithm (Sneath 
et al " NunwtriCfP Taxonomy - ^h- ^ n n r1p . aTV , ^^< r _ nf 
Humeri ^rnnnmy , Freeman Press (1973)), which guides the 
alignment of ancestral sequences. The final phylogeny is 
produced by applying the neighborhood joining method of Saitou 
et al., Mpl , q ip l Fvol , 4:406-425 (1987), to the distance 
and alignment data. 

Nucleotide sequence and primer selection. The sequence 
rragment (Table 16) selected for PCR has now been identified 
to be from nucleotides 9364 to 10,011 of the incomolete GLRaV- 
3 genome (Table 4) . This sequence region encodes a short 
peptide which, shares sequence similarity to ESP90 homologues 
of other enteroviruses (Figure 1) . Selected primers and 
their designations are shown in Table 16, which shows the 
nucleotide and amino acid sequences of a PCR amolified 
fragment of the GLRaV-3 genome. The external and internal 



WO 97/22700 PCT/US96/20747 

32 

primers used for PCR are underlined and their orientations are 
indicated by arrows. 

Sample preparation. These include 1) dsRNA, 2) purified 
virus, 3) partially purified virus, 4) proteinase K treated 
5 crude extract, and 5) immuno-capture preparation. 

Isolation of dsRNA from leafroll infected grapevine 
tissues followed the procedure developed by Hu (1990) . 

Virus purification was effected by the following 
procedure. An aliquot of 500 fil GLRaV-3 -enriched fractions 

10 after two cycles of Cs 2 S0 4 gradient was diluted with two 
volumes of TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0) and 
incubated on ice for 5 min. The reaction was then adjusted to 
a final concentration of 200 mM NaAc, pH 5,0, C.5% SDS, and 
200 /ig/ml proteinase K and incubated at 3 7°C for 3 h. Viral 

15 RNA was extracted with phenol and chloroform, ethanol- 
precipitated, and resuspended in 50 fil of diethyl 
pyrocarbonate (DEPC) -treated H 2 0. For each 100 /zl PCR reaction 
mixture, 1 ill of purified viral RNA was used as template. 

Partially purified virus was prepared according to the 

20 virus purification procedure described in Hu (1990) , but only 
to the high speed centrif ugation (27,000 rpm, 2 h) step 
without further Cs 2 S0 4 gradient centrif ugation. The pellet was 
resuspended in TE buffer and subjected to proteinase K 
treatment as described above. Viral RNA was extracted with 

25 phenol/chloroform and precipitated using ethanol. From 10 g 
of starting material, the pellet was resuspended in 200 fil of 
DEPC treated H 2 0. A 1 p.1 aliquot of extracted RNA or its 10- 
fold dilution series (up to 10" s ) was used for reverse 
transcription-PCR (RT-PCR) . 

30 Crude extract was treated with Proteinase K as follows. 

Liquid nitrogen powdered grapevine bark/phloem tissue (100 mg) 
was macerated in 1 ml of virus extraction buffer (0*5 M Tris- 
HCl, pH 9.0, 0.01 M MgS0 4 , 4% water insoluble polyvinyl 
pyrrolidone (PVP40) , 0.5% bentonite, 0.2% 2-mercaptoethanol, 

35 and 5% Triton X-100) (Zee (1987)) . After a brief 

centrifugation (5,000 rpm, 2 min), 500 /xl of supernatant was 
transferred into a new tube, adjusted to 100 fig/ml proteinase 
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K, and incubated for 1 h at 55°C (Kawasaki, "Sample 
Preparation from Blood, Cells, and Other Fluids," in Innis et 
al., eds, PCR Protocols: A Guide to Methods and Appl ications . 
Academic Press, Inc. (1990)). Following incubation, the 
preparation was boiled for 10 min to inactivate proteinase K 
and to denature the viral RNA. The upper clear phase was 
transferred into a new tube after a brief centrifugation. The 
viral RNA was precipitated with ethanol and resuspended in 100 
111 of DEPC-treated H 2 0. An aliquot of 1 jxl proteinase K- 
treated crude extract or its 10 -fold dilution series (up to 
10" 6 ) was used. 

The immuno- capture procedure was adapted from the method 
described by Wetzel at al., J. Virol. Meth. 39:27-37 (1992)). 
A 0.5 ml thin wall PCR tube was coated directly with 100 ftl of 
10 /xg/rnl purified gamma -globulin from GLRaV-3 antiserum (Zee 
(1987)) in ELISA coating buffer (15 tnM Na 2 C0 3 , 35 mM NaHCO^, pH 
9.6, and 0.02% NaN 3 ) and incubated for 4 h at 30°C. After 
washing 3 times with PBS-Tween-20 , the antibody coated tube 
was loaded with 100 \il of crude extract (1:10 or its 10-fold 
dilution series, up to 10 * a ) prepared in ELISA extraction 
buffer (50 mM sodium citrate, pH 8.3, 20 mM sodium 
diethyldithiocarbonate (DIECA) , 2% PVP 40K) and incubated at 
30°C for 4 h. After washing, a 25 fil aliquot of transfer 
buffer (10 mM Tris, pH 8.0, 1% Triton X-10 0) was added to the 
tube and vortexed thoroughly to release viral RNA. 

RT-PCR. Initially, reverse transcription (RT) and 
polymerase chain reaction (PCR) were performed in two separate 
reactions. An aliquot of 20 fil of reverse transcription 
reaction mixture was prepared to contain 2 fil of 10X PCR 
buffer (Promega, Madison, WI) (10 mM Tris-HCl, pH 8.3, 500 mM 
KC1, and 0.01% gelatin), 50 mM MgCl a , 2 /xl of 10 mM dNTP, 150 J 
ng of 5 ' and 3* primers, 16 units of RNasin, 25 units of avian 
myeloblastosis virus (AMV) reverse transcriptase, and 1 fil of 
a denatured sample preparation. The reverse transcription 
reaction was carried out at 37° C for 3 0 min. After 
denaturation by heating at 95°C for 5 min, an aliquot of PCR 
reaction mixture was added. This PCR reaction mixture (80 fil) 
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contained 8 M l of 10X PCR buffer (Protnega) , 150 mM MgCl 2 , 250 
ng of each 5- and 3 ' primer, 1 M l of 10 mM dNTP, and 2.5 units 
of Tag DNA polymerase . The thermal cycling program was set as 
follows: a precycle at 92°C for 3 min; followed by 35 cycles 
of denaturation at 92-c, 1 min; annealing at 50°C, 1 min; and 
extension at 72"C, 2.5 min. The final extension cycle was set 
at 72°C for 5 min. 

Because reverse transcriptase functions in the PCR buffer 
system, RT and PCR can be combined (RT-PCR) in a single 
reaction (Ali et al. f Bigtffchni mif 15:40-42 (1993); Goblet 
et a1 -' Nuclejc Acids ff^p,^ 17:2144 (1989)). The RT-PCR 
reaction mixture of 100 /xl contains 10 /xl of 10X PCR 
amplification buffer (Promega) , 200 mM MgCl,, 250 ng each of 
primers, 3 M l of 10 mM dNTPs, 40 units of RNasin, 25 units of 
AMV or moloney-murine leukemia virus (M-MLV) reverse 
transcriptase, 2.5 units of Taq DNA polymerase, and 1 M l of 
denatured sample preparation. The thermal cycling program was 
set as follows: one cycle of cDNA synthesis step at 37°C for 
3 0 min, immediately followed by PCR cycling as described 
above . 

Nested PCR. Inconsistent results obtained from a single 
round of PCR amplification prompted an investigation into the 
feasibility of Nested PCR. Initial PCR amplification was 
performed with an external primer set (93-110 & 92-98) (Table 
15) . A PCR product of 648 bp was consistently observed from 
dsRNA as template, but the expected PCR product was not 
consistently observed in samples prepared from proteinase K- 
treated crude extract or immuno- capture sample preparation. 
Consequently, additional PCR amplification with an internal 
primer set (93-25 &. 93-40) was carried out by adding 5 M l of 
the first external primer-amplified PCR product into a freshly 
prepared 100 itl PCR reaction mixture. The PCR cycling 
parameters were as described above. 



E xample 7 - Vims Purif i ral-inn *nH risRNA TgnlaH^ 



GLRaV-3 virus particles were purified directly from field 
collected samples of infected grapevines. Attempts to use 
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genomic RNA for cDNA cloning failed due to low yield of virus 
particles with only partial purity. However, virus particles 
were shown to be decorated by GLRaV-3 antibody using electron 
microscopy. The estimated coat protein molecular weight of 
5 41K agreed with an earlier study (Hu (1990) . Because . of low 
yield in virus purification, dsRNA isolation was further 
pursued. Based on the assumption that high Mr dsRNA (16 kb) 
is the replicative form of the GLRaV-3 genomic RNA, this high 
Mr dsRNA was separated from other smaller ones by 
10 electrophoresis (Figure 2) , purified from a low melting 
temperature agarose gel, and used for cDNA synthesis. 

Fyamplfr 3 - rDNA Sv n i-bpgis. MnTpeular Pinning, and Analysis of 

cDNA Clones. 

15 First -strand cDNA was synthesized with AMV reverse 

transcriptase using purified 16 kb dsRNA which had been 
denatured with 10 mM MeHg as template. Only random primers 
were used to prime the denatured dsRNA because several other 
closteroviruses (BYV, CTV, and LIYV) have been shown to have 

20 no polyadenylated tail on the 3' end (Agranovsky et al., jL. 
ftf>n. virol. . 72:15-24 (1991)); Agranovsky et al. ( V i rology * 
198:311-324 (1994); Karasev et al., Virology, 208:511-520 
(1995); Klaassen et al., Virology. 208:99-110 (1995); Pappu et 
al., Vi mingy . 199:35-46 (1994)). After second-strand cDNA 

25 synthesis, the cDNA was size-fractionated on a CL-4B Sepharose 
column, and peak fractions which contained larger molecular 
weight cDNA were pooled and used for cloning. An 
autoradiograph of this pooled cDNA revealed cDNA molecules up 
to 4 kb in size. 

30 A lambda ZAPII library was prepared from cDNA that was 

synthesized with random primed, reverse transcription of 
GLRaV-3 specific dsRNA. Initially, white/blue color selection 
in IPTG/X-gal containing plates was used to estimate the ratio 
of recombination. There were 15.7% white plaques, and an 

35 estimated 7 X 10 4 GLRaV-3 specific recombinants in this cDNA 
library. The library was screened with probes prepared from 
UNI-AMP™ PCR-amplified GLRaV-3 cDNA. More than 300 clones 
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with inserts of up to 3 kb were selected after screening the 
cDNA library with probe prepared from UNI -AMP™ PCR- amplified 
GLRaV-3 cDNA. In Northern blot hybridization, a probe 
prepared from a clone insert, pC4, reacted strongly to the 16 
5 kb dsRNA as well as to several other smaller Mr dsRNAs , Such 
a reaction was not observed with nucleic acids from healthy 
grape or with dsRNA of CTV (Figure 1) . 

gyamplff 4 - fiplftrtjnn and C hararteri. zat ion of ImmunOPQSitive 

io Clones 

A total of 6 X 10 4 plaques were immunoscreened with GLRaV- 
3 specific polyclonal antibody. Three cDNA clones, designated 
pCP5, pCP8-4, and pCP10-l, produced proteins that reacted to 
the polyclonal antibody to GLRaV-3 . GLRaV-3 antibody 

15 specificity of the clones was further confirmed by their 
reaction to GLRaV-3 monoclonal antibody. PCR analysis of 
cloned inserts showed that a similar size of PCR product (1.0- 
1.1 kb) was cloned in each immunopositive clone using primers 
corresponding to flanking vector seuqences (SK and KS) . 

20 However, various sizes of antibody-reacting protein were 

produced from these clones, which suggested that individual 
clones were independent and contained different segments of 
the coat protein gene. The Mr of immunopositive fusion 
protein from clone pCP10-l was estimated to be 50K in SDS- 

25 PAGE, which was greater than the native coat protein of 41K 
(compare lanes 1 to 4 in Figure 3). Immunopositive proteins 
produced in clone pCP5 (Figure 3, lane 2} and pCP8 (Figure 3, 
lane 3) were different in size and smaller than the native 
coat protein. Clone pCP5 produced a GLRaV-3 antibody-reacting 

30 protein of 29K. Clone pCP8-4, however, produced an antibody- 
reacted protein of 27K. Similar banding patterns were 
observed when either polyclonal (Figure 3 A) or monoclonal 
(Figure 3B) antibodies were used in Western blots. These 
results indicated that these cDNA clones contained coding 

3 5 sequences for the GLRaV-3 coat protein gene. 
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TTvamplA 5 - ltticleotid f fi-gnfrngi' no and Idert i f leaf *on of the 
rnat- P^ot-.Pin Gene 

Both strands of the three immunopositive clones were 
sequenced at least twice, A multiple sequence alignment of 
5 these three clones overlapped and contained an incomplete ORF 
lacking the 3 1 terminal sequence region. The complete 
sequence of this ORF was obtained by sequencing an additional 
clone, pA6-8, which was selected using the clone walking 
strategy. The complete ORF potentially encoded a protein of 

10 313 amino acids with a calculated Mr of 34 , 866 (p35) (Figure 4 
and Tables 2-3) . Table 2 shows the nucleotide and amino acid 
sequences of the coat protein gene of grapevine leafroll 
associated closterovirus-3 , isolate NY1 . Nucleotide 
sequencing was conducted by the procedure described in 

15 Example 1. The translated amino acid sequence is shown below 
the nucleotide sequence. Table 3 compares the alignment of 
the coat protein of GLRaV-3 with respect to BYV, CTV, and 
LIYV. Consensus amino acid residues are shown. Uppercase 
letters indicate identical amino acids, and lowercase letters 

20 indicate at least three identical or functionally similar 

amino acids. The three conserved amino acid residues (S, R # 
and D) identified in all filamentous plant virus coat proteins 
are in bold (Dolja et al., Vi rology, 184 :79-86 (1991)). 
Because this ORF was derived from three independent 

25 clones after screening with GLRaV-3 coat protein specific 

antibody, it was identified as the coat protein gene of GLRaV- 
3. A multiple amino acid sequence alignment of p35 with the 
coat proteins of other closteroviruses, including BYV, CTV, 
and LIYV, is presented in Table 3 . The typical consensus 

30 amino acid residues (S, R, and D) of the coat proteins of the 
filamentous plant viruses (Dolja et al., Virology , 184:79-86 
(1991) } , which may be involved in salt bridge formation and 
the proper folding of the most conserved core region (Boyko et 
al., Proe. Natl Arad. Sci . USA. 89:9156-9160 (1992)}, were 

35 also preserved in the p35. Phylogenetic analysis of the 

GLRaV-3 coat protein amino acid sequence with respect to the 
other filamentous plant viruses placed GLRaV-3 into a separate 
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but closely related branch of the closterovirus (Figure 5) . 
Direct sequence comparison of GLRaV-3 coat protein with 
respect to other closterovirus coat proteins or their diverged 
copies by the GCG Pileup program demonstrated that at the 
nucleotide level, GLRaV-3 had its highest homology to BYV 
(41.5%) and CTV (40.3%). At the amino acid level, however, 
the highest percentage similarity were to the diverged copies 
of coat protein, with 23.5% identity (46.5% similarity) to CTV 
p26 and 22. G% (44.3% similarity) to BYV p24 . 



p^mpi^ fi - TH« n Mfir»Hon of a Coat Prntftin Translation 

Tn-tHafirm Site 

Various sizes of GLRaV-3 specific antibody-reactive 
proteins were produced by three immunopositive clones in E . 
15 coli (Figure 3) . Sequences of these clones overlapped and 
represented a common ORF that was identified as the coat 
protein gene (Figure 4) . In searching for possible 
translation regulatory elements, sequence analysis beyond the 
coat protein coding region revealed a purine rich sequence, - 
20 uGAGuGAAcgcgAQQ- (SEQ ID NO: 26) , which was similar to the 

Shine -Dalgarno sequence (uppercase letters) (Shine et al., 
Pr-nrv MaM . &™H firri . USA. 71:1342-1346 (1974),), upstream 
from the coat protein initiation site (AUG) . This purine 
rich sequence can serve as an alternative ribosome entry site 
25 for the translation of the GLRaV-3 coat protein gene in E. 

coli. If this first AUG in the ORF serves for coat protein 
translation, the ribosomal entry site must be located in this 
purine rich region because an in-frame translation stop codon 
(UGA) was only nine nucleotides upstream from the coat protein 
30 gene translation initiation site (AUG) . Analysis of 

nucleotide sequence beyond the cloned insert into the vector 
sequence of clone pCP8-4 and pCP10-l provided direct evidence 
that the fusion protein was made from the N-terminal portion 
of coat protein and C- terminal portion of p-galactosidase 
35 (16. 5K). Further analysis of sequence around the selected AUG 
initiation codon of the coat protein gene revealed a consensus 
sequence 
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(-GnnADSG-) that favored the expression of eucaryotic mRNAs 
(Kozak, M^r^H^i^^l p«mH«wb. 47:1-45 (1983); Kozak, Call, 

44:283-292 (1986) ) . 

Nucleotide sequence analysis of three immunopositive 
clones revealed overlapping sequences and an ORF that covers 
about 96% of the estimated coat protein gene (Figure 4) . The 
complete ORF was obtained after sequencing of an additional 
clone ( P A6-8) that was selected by the clone walking strategy. 
Identification of this ORF as the coat protein gene was based 
upon its immunoreactivity to GLRaV-3 polyclonal and monoclonal 
antibodies, the presence of filamentous virus coat protein 
consensus amino acid residues (S, R, and D) , and the 
identification of a potential translation initiation site. 
The calculated coat protein molecular weight (35K) is smaller 
than what was estimated on SDS-PAGE (41K) . This discrepancy 
in molecular weight between computer-calculated and SDS-PAGE 
estimated falls in the expected range. 

The estimated coat protein Mr of GLRaV-3 and another 
grape closterovirus-like designated GLRaV-1 are larger than 
the 22-28K coat protein range reported for other well 
characterized enteroviruses such as BYV, CTV, and LIYV 
(Agranovsky (1991); Bar-Joseph et al., "enteroviruses, - 
cmt /ARB . No. 260 (1982), Klaassen et al., ,T HP n ■ Viro l., 
75:1525-1533 (1994); (Martelli et al . , -enterovirus, 
Classification and Nomenclature of Viruses, Fifth Report of 
the International Committee on Taxonomy of Viruses," in 
irrHvP. of v^ninav 9iiT>n1 eiw»nttim 2, Martelli et al., eds., 
New York: Springer-Verlag Wein, pp. 345-347 (1991); Sekiya et 
al., lT r.~r, virol. . 72:1013-1020 (1991)). Hu (1990) 
suggested a possible coat protein dimer. The present sequence 
data, however, do not support this suggestion. First, the 
size of the coat protein is 35K, which is smaller than what 
would be expected of a coat protein dimer. Second, a multiple 
sequence alignment of N-terminal half and C-terminal half of 
GLRaV-3 coat protein with the coat proteins of other 
enteroviruses showed that the filamentous virus coat protein 
consensus amino acid residues (S, R, and D) are only present 
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in the C-terminal portion, but not in the N-terminal portion 
of the coat protein. 

Example 7 - Primer Selection. 
5 Primers were selected based on the nucleotide sequence of 

clone pC4 which had been shown to hybridize to GLRaV-3 dsRNAs 
on a Northern hybridization (Figure 1) . The 648 bp sequence 
amplified by PCR was identified as nucleotides 9,364 to 10,011 
of the incomplete GLRaV-3 genome (Table 4) . This sequence 

10 fragment encodes a short peptide which shows some degree of 
amino acid sequence similarity to heat shock protein 90 
(HSP90) homologues of other closteroviruses, BYV, CTV, and 
LIYV (Table 5) . Two sets of primer sequences and their 
designations (external, 93-110 & 92-98, and internal, 93-25 & 

15 93-40) are shown in Table 15. Effectiveness of synthesized 
primers to amplify the expected PCR product was first 
evaluated on its respective cDNA clone, pC4 (Figure 6, lane 
11) - 



20 Example 8 - Development of a Simple and Effective PCR Sampl e 

Preparation, 

Initially purified dsRNA was used in a RT-PCR reaction. 
The expected 219 bp PCR product was consistently observed with 
the internal set of primers (Figure 6, lane 10) . To test 

25 whether or not these primers derived from GLRaV-3 specific 
dsRNA sequence is in fact the GLRaV-3 genome sequence, RNA 
extracted from a highly purified virus preparation was 
included in an assay. As expected, PCR products with similar 
size (219 bp) were observed in cloned plasmid DNA (pC4) 

30 (Figure 6, lane 11), dsRNA (Figure 6, lane 10) as well as 

purified viral RNA (Figure 6, lane 9) . This PCR result was 
the first evidence that dsRNA isolated from leaf roll-infected 
tissue was derived from the GLRaV-3 genome. However, PCR 
sample preparations from the purified virus procedure are too 

35 complicated to be used for leafroll diagnosis. Simplification 
sample preparations used viral RNA extracted from a partially 
purified virus preparation. This partially purified virus 
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preparation was again shown to be effective in RT-PCR (Figure 
6) . Sensitivity of RT-PCR was further evaluated with 10- fold 
serial dilution (up to 1CT 5 > of a sample. The expected PCR 
product of 219 bp in a partially purified virus preparation 
was observable up to the 1CT 3 dilution (Figure 6, lane 4). 
Although RT-PCR was shown again to work with partially 
purified virus preparations, this method of sample preparation 
was still too complicated to be used in a routine disease 
diagnosis. Over 10 attempts to directly use crude extract for 
RT-PCR were unsuccessful. Proteinase K-treated crude extract 
was by far the most simple and still effective pretreatment 
for RT-PCR. Therefore, the proteinase K-treated crude extract 
was used to evaluate RT-PCR for its ability to detect GLRaV-3 . 



Pvaniplo Q - RT-PCR 

With proteinase K-treated crude extract prepared from 
scraped phloem tissue collected from a typical leafroll 
infected vine (Doolittle's vineyard, New York), a PCR product 
of 219 bp was readily observed. However, application of this 
sample preparation method to other field collected samples 
(USDA, PGRU, Geneva, NY) was disappointing. With different 
batches of sample preparations, a range of 3 to 10 out of 12 
ELISA positive samples were shown to have the expected PCR 
products. To determine whether these inconsistent results 
were due to some kind of enzyme (reverse transcriptase or Taq 
DNA polymerase) inhibition present in the proteinase K-treated 
crude extract, increasing amounts of a sample were added into 
an aliquot of 100 fil PCR reaction mixture. PCR products of 
219 bp were readily observed from samples of 0.1 /il (lane 1) 
and 1 pi (lane 2) but not from 10 fil. Presumably, sufficient 
amount of enzyme inhibitors was present in the .10 /il of this 
sample. 

T?yam r tft 1 ft - Tmmunn-ranMire flT-PCR 

The immuno- capture method further simplified sample 
preparation by directly using crude extracts that were 
prepared in the standard ELISA extraction buffer. Immuno- 
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capture RT-PCR (IC RT-PCR} tests were initially performed with 
the internal primer set, and the expected PCR product of 219 
bp was observed from a typical leaf roll infected sample. 
However, this PCR method to test a range of field collected 
5 ELISA positive samples gave inconsistent results. In a PCR 

test performed with the external primer set, only five out of 
seven field collected ELISA positive samples were shown to 
amplify the expected PCR product {648 bp) (Figure 7A) . 
Meanwhile, the expected PCR product was consistently observed 
10 in dsRNA (Figure 7A, lane 10} , but such product was never 

observed in the healthy control (Figure 7A f lane 9) . In this 
case, however, the expected PCR product was not observed in a 
sample prepared using proteinase K- treated crude extract 
(Figure 7A, lane 8) . 

15 

Example 11 - Nested PCR 

As described above, inconsistency of RT-PCR was 
experienced with samples prepared either by the proteinase K- 
treated or by the immuno -capture methods. If this PCR 

20 technique is to be used in disease diagnosis, a consistent and 
reproducible result is needed. Thus, the Nested PCR method 
was introduced. Although an expected PCR product of 648 bp 
from the first PCR amplification with the external primer set 
was not always observable (Figure 7A) , in a Nested PCR 

25 amplification with the internal primer set, the expected 219 
bp PCR product was consistently observed from all seven ELISA 
positive samples (Figure 7B) . These products were observed in 
dsRNA (Figure 7B, lane 10) and in the proteinase K- treated 
crude extract (Figure 7B, lane 8) but not in a healthy control 

30 (Figure 7B, lane 9) . To determine the sensitivity of Nested 

PCR with samples prepared either by proteinase K-treated or by 
immuno -capture methods, Nested PCR and ELISA were performed 
simultaneously with samples prepared from a 10-fold dilution 
series. The sensitivity of Nested PCR was shown to be 10* 5 in 

35 proteinase K-treated crude extract (Figure 8A) , and was more 
than 10" 8 (the highest dilution point in this test) in an 
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immuno-capture preparation (Figure 8B) . With similar sample 
preparations, sensitivity for ELISA was only 10' 2 . 

r^mpT* 13 - Va ldai-inn of *H? with FTJSA fmri ^ndftXinq 

5 To determine whether the PCR-based GLRaV-3 detection 

method described in this study has a practical application in 
grapevine leaf roll disease diagnosis, a validation experiment 
with plants characterized thoroughly by ELISA and indexing is 
necessary. Several grapevines collected at USDA-PGRU at 

10 Geneva, New York, which have been well characterized by 3 -year 
biological indexing and by ELISA were selected for validation 
tests. A perfect correlation was observed between ELISA 
positive and PCR positive samples, although there was some 
discrepancy over indexing which suggested that other types of 

15 closteroviruses may also be involved in the grapevine leafroll 
disease (Table 7) . 

PCR technology has been applied to detect viruses, 
viroids and phytoplasmas in the field of plant pathology (Levy 
et al., ^irmal nf Vn^nlncHral Methods, 49:295-304 (1994)). 

20 Because of the presence of enzyme inhibitors (reverse 
transcriptase and/or Taq DNA polymerase) in many plant 
tissues, a lengthy and complicated procedure is usually 
required to prepare a sample for PCR. In studies of PCR 
detection of grapevine fanleaf virus, Rowhani et al., 

25 Phytopathology . 83:749-753 (1993), have already observed an 
enzyme inhibitory phenomenon. Substances including phenolic 
compounds and polysaccharides in grapevine tissues were 
suggested to be involved in enzyme inhibition. 

One of the objectives in the present study was to develop 

30 a sound practical procedure for sample preparation to 

eliminate this inhibitory problem for PCR detection of GLRaV-3 
in grapevine tissues. Although the expected PCR product was 
consistently observed from samples of dsRNA, purified virus 
and partial purified virus, proteinase K-treated crude extract 

35 and immuno-capture methods were the simplest and were still 
effective. Samples prepared with proteinase K-treated crude 
extract have an advantage over others in that hazardous 
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organic solvents, such as phenol and chloroform, are avoided. 
However, care must be taken in the sample concentration 
because the reaction can be inhibited by adding too much 
grapevine tissue. Minafra et al., J. Virol. Methods , 47 : 175- 
5 188 (1994) , reported the successful PCR detection of grapevine 
virus A, grapevine virus B, and GLRaV-3 with crude saps 
prepared from infected grapevine tissues, this method of 
sample preparation was, however, not effective in the present 
study. The similar primers used by Minafra (1994) , were, 

10 however, able to amplify the expected size of PCR products 
from dsRNA of the NY1 isolate of GLRaV-3. 

Immuno- capture is another simple and efficient method of 
sample preparation (Wetzel (1992) , which is hereby 
incorporated by reference) . First, crude EL ISA extracts can 

15 be used directly for RT-PCR. Second, it provides not only a 
definitive answer, but may also be an indication to a virus 
serotype. Third, with an immuno -capture step, virus particles 
are trapped by an antibody, and inhibitory substances may be 
washed away. Nested PCR with samples prepared by the immuno- 

20 capture method is 10 3 times more sensitive than with samples 

prepared by proteinase K- treated crude extract. However, this 
approach requires a virus specific antibody. For some newly 
discovered or hard to purify viruses, a virus specific 
antibody might not be available. There are at least six 

25 serologically distinctive closteroviruses associated with 
grapevine leafroll disease (Boscia (1995)). 

Example 13 - Nucle otide Sequence and Open Reading Frames 

A lambda ZAPII library was prepared from cDNA that was 

30 synthesized with random primed, reverse transcription of 

GLRaV-3 specific dsRNA. Initially, white/blue color selection 
in IPTG/X-gal containing plates was used to estimate the ratio 
of recombination. There were 15.7% white plaques, or an 
estimated 7 X 10 4 GLRaV-3 specific recombinants in this cDNA 

35 library. The library was screened with probes prepared from 
UNI-AMP™ PCR-amplified GLRaV-3 cDNA. More than 300 clones 
with inserts of up to 3 kb were selected after screening the 
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cDNA library with probe prepared from UNI -AMP™ PCR-amplif ied 
GLRaV-3 cDNA. In Northern blot hybridization, a probe 
prepared from the cloned insert of pC4, reacted strongly to 
the 16 kb dsRNA as well as to several other smaller Mr dsRNAs , 
No hybridization with nucleic acids from healthy grape or to 
dsRNA of CTV was observed (Figure 1) . 

Clone pB3-l was selected and sequenced after screening 
the library with HSP70 degenerate primer 

(5 ' GGIGGIGGIACITTYGAYGTITCI (SEQ ID NO:25)). Other clones 
that were chosen for nucleotide sequencing were selected by 
the clone walking strategy. The nucleotide sequencing 
strategy employed was based on terminal sequencing of random 
selected clones assisted with GCG fragment assembly program to 
assemble and extend the sequence. The step-by-step primer 
extension method was used to sequence the internal region of a 
selected clone. A total of 54 clones were selected for 
sequencing. Among them, 16 clones were completely sequenced 
on both DNA strands (Figure 9) . 

A total of 15,227 nucleotides were sequenced; nine open 
reading frames (ORFs) were identified designated as ORFs la, 
lb, and 2 to 8, The sequenced region was estimated to cover 
about 80% of the complete GLRaV-3 genome. Major genetic 
components, such as helicase (ORF la) , RdRp (ORF lb) , HSP70 
homologue (ORF 4) , HSP90 hornologue (ORF 5) and coat protein 
(ORF 6) were identified. 

ORF la was an incomplete ORF from which the 5 ' terminal 
portion has yet to be cloned and sequenced. The sequenced 
region presented in Figure 10 and Table 4 represents 
approximately two- thirds of the expected ORF la , as compared 
to the ORF la from BYV, CTV, and LIYV. The partial ORF la was 
terminated by the UGA stop codon at positions 4165-4167; the 
respective product consisted of 13 88 amino acid residues and 
had a deduced Mr of 148,603. Database searching indicated 
that the C-terminal portion of this protein shared significant 
similarity with the Superfamily 1 helicase of positive-strand 
RNA viruses. Comparison of the conserved domain region (291 
amino acids) showed a 3 8.4% identity with an additional 19.7% 
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similarity between GLRaV-3 and BYV and a 32.4% identity with 
an additional 21.1% similarity between GLRaV-3 and LIYV (Table 
6) ■ Six helicase conserved motifs of Superfamily 1 helicase 
of positive-strand RNA viruses (Hodgman, Nature , 333:22-23 
5 (Erratum 578) (1988); Koonin et al., Crit. Rev. Rinrhpni, 

Molec . Biol . . 28:375-430 (1993)) were also retained in GLRaV- 
3. Analysis of the predicted phylogenetic relationship in 
helicase domains between GLRaV-3 and the other positive-strand 
RNA viruses placed GLRaV-3 along with the other 

10 closteroviruses, including BYV, CTV, and LIYV, into the 

"tobamo" branch of the alphavirus-like supergroup (Figure 11, 
Table 5) . Table 5 compares the amino acid sequence alignment 
of the helicase of GLRaV-3 with respect to BYV, CTV, and LIYV. 
Consensus amino acid residues are shown. Uppercase letters 

15 indicate identical amino acids, lowercase letters indicate at 
least three identical or functionally similar amino acids. 
Six conserved motifs (I to VI) that are conserved among the 
Superfamily 1 helicase (Koonin et al., Crit. Rev, in Rior-h^ 
Moler. Biol . r 28:375-430 (1993)) of the positive-strand RNA 

2 0 viruses are overlined. 

ORF lb overlapped the last 113 nucleotides of ORF la and 
terminated at the UAG codon at positions 5780 to 5782. This 
ORF encodes a protein of 536 amino acid residues with a 
calculated Mr of 61,050 (Figures 10, Table 4). Database 
25 screening of this protein revealed significant similarity to 
the Supergroup 3 RdRp of the positive-strand RNA viruses. 
Sequence comparison of GLRaV-3 with BYV, LIYV, and CTV over a 
313 -amino acid sequence fragment revealed a striking amino 
acid sequence similarity among eight conserved motifs (Table 

3 0 8) • Consensus amino acid residues are shown. Uppercase 

letters indicate identical amino acids, and lowercase letters 
indicate at least three identical or functionally similar 
amino acids. The motifs (I to VIII) that are conserved among 
the Supergroup 3 RNA polymerase of positive-strand RNA viruses 
3 5 are overlined. The best alignment was with BYV, while the 
least alignment was with LIYV (Table 6) . Analysis of 
predicted phylogenetic relationships of the RdRp domains of 
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the alphavirus-like supergroup viruses again placed GLRaV-3 
into the tobamo branch along with other closteroviruses , BYV, 
CTV, BYSV, and LIYV (Figure 12) . 

Publications on BYV, CTV, and LIYV have proposed that ORF 
lb is expressed via a +1 ribosomal frameshift (Agranovsky 
(1994); Dolja et al., Ann. Rev. Phvtopafhnl 32:261-285 
(1994); Karasev (1995), and Klaassen (1995)). Direct 
nucleotide sequence comparison was performed within the 
ORFla/lb overlap of GLRaV-3 with respect to BYV, CTV, or LIYV. 
An apparently significant similarity was observed only to LIYV 
(Table 9), and not to BYV or CTV. The so-called "slippery" 
GGGUUU sequence and the stem-and-loop structure that were 
proposed to be involved in the BYV frameshift was absent from 
the GLRaV-3 ORFla/lb overlap. The predicted frameshift within 
the GLRaV-3 ORF la/lb overlap was selected based on an 
inspection of the C-terminal portion of the helicase alignment 
and the N-terminal portion of the RdRp alignment between 
GLRaV-3 and LIYV. 

Table 9 compares the aligned GLRaV-3 and LIYV nucleotide 
sequences (presented as DNA) in the vicinity of the proposed 
frameshift, nt 4099-4165 in GLRaV-3 and nt 5649-5715 in LIYV. 
Identical nucleotides are uppercase. LIYV predicted +1 
frameshift region (aAAG) and the corresponding GLRaV-3 (cACA) 
are bold and italic. The encoded C-terminus of HEL and N- 
terminus of RdRp are presented above (GLRaV-3) and below 
(LIYV) the nucleotide alignment. Repeat sequences are 
underlined. 

The GLRaV-3 ORF la/lb frameshift was predicted to occur in 
the homologous region of the LIYV genome, and was also 
preceded by a repeat sequence (GCTT) (Figure 24). Unlike 
LIYV, this repeat sequence was not a tandem repeat and was 
separated by one nucleotide (T) in GLRaV-3. 

The frameshift was predicted to occur at CACA (from His 
to Thr) in GLRaV-3 rather than slippery sequence AAAG in LIYV. 
However, additional experiments on in vitro expression of 
GLRaV-3 genomic RNA are needed in order to determine whether 
or not a large fusion protein is actually produced. 
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ORF 2 is predicted to encode a small peptide of 51 amino 
acids and a calculated Mr of 5,927. Database searching did 
not reveal any obvious protein matches within the existing 
Genbank (Release 84.0). 

Intergenic regions of 220 bp between ORF lb and ORF 2 and 
1065 bp between ORF 2 and ORF 3 were identified. There is no 
counterpart in the BYV or LIYV genomes; instead, an ORF of 33K 
in CTV (Karasev et al., iT v^nl . . 75:1415-1422 (1994)) 

or 32K in LIYV (Klaassen (1995) ) is observed over this similar 
region . 

ORF 3 encodes a small peptide of 45 amino acids and a 
calculated Afr of 5,090 (p5K) . Database searching revealed 
that it was most closely related to the small hydrophobic, 
transmembrane proteins of BYV (6.4K), CTV (6K) , and LIYV (5K) . 
Individual comparison (Table 3) showed that LIYV was its 
closest relative (45.8%) at the nucleotide level and BYV was 
the most homologous (30.4%) at the amino acid level. 

Table 10 compares the aligned amino acid sequences the 
small hydrophobic transmembrane protein of GLRaV-3 p5K with 
those of BYV (p6K) , CTV (p6K) , and LIYV (p5K) . Consensus 
amino acid residues are shown. Lowercase letters indicate at 
least three identical or functionally similar amino acids. 
The transmembrane domain that has been identified in several 
other enteroviruses, BYV, CTV, and LIYV (Karasev et al . , 
virology . 208:511-520 (1995)), is overlined. 

ORF 4 encodes a protein of 549 amino acids and a 
calculated Mr of 59,113 (p59) (Figure 10, Table 4). Database 
screening revealed significant similarity to the HSP70 family, 
the p65 protein of BYV, the p65 protein of CTV, and the p62 
protein of LIYV. A multiple amino acid sequence alignment of 
GLRaV-3 p59 with HSP70 analogs of other enteroviruses showed 
striking sequence similarity among eight conserved motifs (A- 
H) . Functionally important motifs (A-C) that are 
characteristic of all proteins containing the ATPase domain of 
the HSP70 type (Bork et al., P . ror Natl ftrflfl flfil . PSA , 
89:7290-7294 (1992)) were also preserved in GLRaV-3 p59, which 
suggested that this HSP70 chaperonin-like protein may also 
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possess ATPase activity on its N-ternunal domain and protein- 
protein interaction on its C- terminal domain (Dolja (1994) . 

Table 11 presents the amino acid sequence alignment of 
the HSP70 -related protein of GLRaV-3 (p59K) with those of BYV 
(p65K), CTV (p65K), and LIYV (p62K) . The eight conserved 
motifs (A to H) of cellular HSP70 are overlined. Consensus 
amino acid residues are shown. Uppercase letters indicate 
identical amino acids, and lowercase letters indicate at least 
three identical or functionally similar amino acids. 

Analysis of the predicted phylogenetic relationship of 
p59 of GLRaV-3 with HSP70-related proteins of other 
enteroviruses (BYV, CTV, and BYSV) and cellular HSP70s again 
placed the four enteroviruses together and the rest of the 
cellular HSP70S on the other branches (Figure 13). Although 
several closterovirus HSP70 -related proteins are closely 
related to each other and distant from other cellular members 
of this family, inspection of the phylogenetic tree indicates 
that GLRaV-3 may be an ancestral closterovirus relatively 
early in evolution as predicted by Dolja (1994) , because 
GLRaV-3 was placed in between enteroviruses and the other 
cellular HSP70 members. 

ORF 5 encodes a protein of 483 amino acids with a 
calculated Mr of 54,852 (p55) (Figure 10, Table 4). Table 12 
compares the alignment of the amino acid sequence deduced from 
the PCR fragment of GLRaV-3 with respective regions of HSP90 
homologues of beet yellow virus (BYV) ( P 64) , citrus tristeza 
virus (CTV) (p61), and lettuce infectious yellow virus (LIYV) 
(p59) . Consensus amino acid residues are shown. Uppercase 
letters indicate identical amino acids, lowercase letters 
indicate at least three identical or functionally similar 
amino acids. 

No significant sequence homology with other proteins was 
observed in the current database (GenBank, release 84.0). 
Direct comparison with other counterparts (p61 of CTV, p64 of 
BYV, and p59 of LIYV) of enteroviruses revealed some degree 
of amino acid sequence similarity, with 21.7% to BYV, 17.5% to 
CTV, and 16.7% to LIYV, respectively (Tables 6, 11, 12). Two 
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conserved regions of HSP90 previously described in BYV and CTV 
(Pappu (1994)) were identified in the p55 of GLRaV-3 (Table 
13) . 

ORF 6 encodes a protein of 313 amino acids with a 
5 calculated Mr of 34,866 <p35) (Figure 10 and Table 4). The 
fact that this ORF was encoded by three overlapping GLRaV-3 
immunpositive clones indicates that it contains the coat 
protein gene of GLRaV-3 . Alignment of the product of ORF 6 
(p35) with the coat protein sequences of BYV, CTV, and LIYV, 

10 is presented in Table 3 . The typical consensus amino acid 

residues (S, R, and D) of the coat protein of the filamentous 
plant viruses (Dolja (1991)), which may be involved in salt 
bridge formation and the proper folding of the most conserved 
core region (Boyko (1992)), were also retained in the p35 

15 (Table 3) . Individual sequence comparison showed the highest 
similarity to CTV (20.5%) and BYV (20.3%), and the lowest 
similarity to LIYV (17.8%). Analysis of predicted 
phylogenetic relationships with other filamentous plant 
viruses tentatively placed GLRaV-3 into a separate, but a 

20 closely related branch of closteroviruses (Figure 5) . 

ORF 7 encodes a protein of 477 amino acids with a 
calculated Mr of 53,104 (p53) (Figure 10 and Table 4) . Based 
on the presence of conserved, seuqences, this protein is 
designated as grapevine leafroll virus coat protein repeat 

25 (p53) . 

ORF 8 encodes an undefined polypeptide of a calculated Mr 
Of 21,248 (p21) . 

ORF 9 encodes an undefined protein of calculated Mr of 
19,588 (p20) . 

3 0 ORF 10 encodes an undefined polypeptide with a calculated 

Mr of 19,653 (p20) . 

ORF 11 encodes an undefined protein of calculated Mr of 
6963 (p7) . 

In the present study, many GLRaV-3 dsRNA specific cDNA 
35 clones were identified using a probe generated from UNI -AMP™ 
PCR-amplif ied cDNA. Using UNI -AMP™ adapters and primers 
(Clontech) in PCR has several advantages. First, it is not 
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necesS ary to know the nucleotide sequence of an amplified 
fragment. Second, cDNA can be amplified in sufficient amounts 
for specific probe preparation. In general, cDNA amplified by 
PCR using UNI -AMP™ primers and adapters could be used for 
5 cloning as well as a probe for screening of cDNA libraries. 
However, low abundance of the starting material and many 
cycles of PCR amplification often incorporate errors into the 
nucleotide sequence (Keohavong et al., PrPC. Natl, Acad. Sci , 
USA, 86:9253-9257 (1989); Saiki et al . , Science, 239:487-491 

10 (1988)) ♦ In the present study, only UNI-AMP™ PCR amplified 

cDNA was used as a probe for screening. The cDNA library was 
generated by direct cloning of the cDNA that was synthesized 
by AMV reverse transcriptase. Therefore, the cDNA cloned 
inserts are believed to more accurately reflect the actual 

15 sequence of the dsRNA and the genomic RNA of GLRaV-3 . 

A total of 15 , 227 nucleotides or about 80% of the 
estimated 16 kb GLRaV-3 dsRNA was cloned and sequenced. 
Identification of this sequence fragment as the GLRaV-3 genome 
was based on its sequence alignment with the coat protein gene 

20 of GLRaV-3 . This is the first direct evidence showing that 
high molecular weight dsRNA (-16 kb) isolated from GLRaV-3 
infected vines is derived from GLRaV-3 genomic RNA. Based 
upon the nine ORFs identified, the genome organization of 
GLRaV-3 bears significant similarity to the other 

25 closteroviruses sequenced (BYV, CTV, and LIYV) (Figure 10) . 

Dolja (1994) tentatively divided the closterovirus genome 
into four modules. For GLRaV-3, the 5' accessory module 
including protease and vector transmission factor is yet to be 
identified. The core module, including key domains in RNA 

30 replication machinery (MET-HEL-RdRp) that is conserved 

throughout the alphavirus supergroup, has been revealed in 
parts of the HEL and RdRp domains. The MET domain has not yet 
been identified for GLRaV-3. The chaperon module, including 
three ORFs coding for the small transmembrane protein, the 

3 5 HSP70 homologue, and the distantly related HSP90 homologue, 
has been fully sequenced. The last module includes coat 
protein and its possible diverged copy and is also preserved 
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in GLRaV-3. Overall similarity of the genome organization of 
GLRaV-3 with other closteroviruses further support the 
inclusion of GLRaV-3 as a member of closteroviruses (Hu (1990) 
and Martelli (1991) , which are hereby incorporated by 
reference) . However, observation of a predicted ambisense 
gene on its 3' terminal region may separate GLRaV-3 from other 
closteroviruses. Further comparative sequence analysis (Table 
3) as well as phylogenetic observation of GLRaV-3 with respect 
to other closteroviruses over the entire genome sequence 
region suggested that GLRaV-3 is most closely related to BYV, 
followed by CTV, and LIYV. 

As suggested by others (Agranovsky (1994), Dolja (1994), 
Karasev (1995), and Klaassen (1995)), expression of ORF lb in 
closteroviruses may be via a +1 ribosomal frameshift 
mechanism. In GLRaV-3, a potential translation frameshift of 
ORF lb could make a fusion HEL-RdRp protein of over 1,926 
amino acid residues with a capacity to encode a protein of 
more than 210K Comparative study of GLRaV-3 with respect to 
other closteroviruses over the ORF la/lb overlap revealed a 
significant sequence similarity to LIYV, but not to BYV or to 
CTV. The so-called slippery sequence (GGGUUU) and stem- loop 
and pseudoknot structures identified in BYV (Agranovsky 
(1994) , which is hereby incorporated by reference) is not 
present in GLRaV-3. Thus, a frameshift mechanism that is 
similar to LIYV may be employed for GLRaV-3. However, protein 
analysis is necessary in order to determine the protein 
encoding capacities of these ORFs. 

Differing from BYV, both CTV and LIYV have an extra ORF 
(ORF 2) in between RdRp (ORF lb) and the small membrane 
protein (ORF 3) and potentially encoding a protein of 33K or 
32K, respectively. However, in GLRaV-3, there is a much 
smaller ORF 2 (7K) followed by a long intergenic region of 
1065 bp. 

So far, among all plant viruses described, the HSP70 
related gene is present only in the closteroviruses (Dolja 
(1994)). Identification of the GLRaV-3 HSP70 gene was based 
on an assumption that this gene should also be present in the 
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closterovirus associated with grapevine leaf roll disease, 
specifically GLRaV- 3 . Thus, cDNA clones that reacted with 
HSP70 -degenerated primers were identified for sequence 
analysis. The identification of subsequent clones for 
5 sequencing was based on the gene-walking methodology. 

However, identification of immunopositive clones enabled 
identification of the coat protein gene of GLRaV- 3 and proved 
that the HSP70 -containing sequence fragment is present in the 
GLRaV- 3 RNA genome, 

10 The 16 kb dsRNA used for cDNA synthesis was assumed to be 

a virus replicative form (Hu (1990) . Selected clones have 
been shown by Northern hybridization to hybridize to the 16 kb 
dsRNA and several smaller RNAs (presumably subgenomic RNAs) 
(Figure 1) . Second, three GLRaV- 3 antibody- reacting clones 

15 were identified after immuno- screening of the protein 

expression library with both GLRaV -3 polyclonal (Zee (1987)) 
and monoclonal (Hu (1990)) antibodies. After nucleotide 
sequencing, these three antibody-reacting clones were shown to 
overlap one another and contain a common ORF which potentially 

20 encodes a protein with calculated Mr of 35K. This is 

consistent with the Mr estimated on SDS-PAGE (41K) . Third, 
analysis of the partial genome sequence of GLRaV- 3 suggested a 
close similarity in genome organization and gene sequences to 
the other closteroviruses (Dolja (1994)). 

25 Information regarding the genome of GLRaV- 3 provides a 

better understanding of this and related viruses and adds to 
the fundamental knowledge of closteroviruses. Present work on 
the nucleotide sequence and genome organization (about 80% of 
the estimated genome sequence) has provided direct evidence 

30 for a close relationship between GLRaV- 3 and other 

closteroviruses- It has also enabled, for the first time, the 
evaluation of phylogenetic relationships of GLRaV- 3 based on a 
wide range of genes and gene products (helicase, polymerase, 
HSP70 homologue, HSP90 homologue, and coat protein) . Based 

35 upon major differences in genome format and organization 

between BYV, CTV, and LIYV, along with phylogenetic analysis, 
Dolja (1994)) proposed the establishment of the new family 
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Closteroviridae with three new genera of Closterovirus (BYV) , 
Citrivirus (CTV) , and Biclovirus (LIYVJ . This work on genome 
organization and phylogenetic analysis, along with evidence 
that this virus is transmitted by mealybugs (see hereinabove) 
5 indicates that a new genus under Closteroviridae family should 
be established. Thus, GLRaV-3 (the NY1 isolate) is proposed 
as the type representative of the new genus, Graclovirus (gra- 
pevine clo-sterovirus) . Further sequencing of other grapevine 
leafroll associated closteroviruses may add more members to 

10 this genus. 

Another cDNA library of GLRaV-3 has been established 
recently from dsRNA of an Italian isolate of GLRaV-3 
(Saldarelli et al.. Plant- Pathology fQxford) , 43:91-96 (1994), 
which is hereby incorporated by rsference) . Selected clones 

15 react specifically to GLRaV-3 dsRNA on a Northern blot; 

however, no direct evidence was provided to suggest that those 
clones were indeed from GLRaV-3 genomic RNA. Meanwhile, a 
small piece of sequence information from one of those cDNA 
clones was used to synthesize primers for the development of a 

20 PCR detection method (Minafra (1994), which is hereby 

incorporated by reference) . Direct sequence comparison of 
these primer sequences to GLRaV-3 genome sequence obtained in 
the present study, showed that one of the primers (H229, 
5 1 ATAAGCATTCGGGATGGACC (SEQ ID NO: 27)) is located at 

25 nucleotides 5562-5581 and the other (C547, 

5 1 ATTAACtTgACGGATGGCACGC (SEQ ID NO: 28)) is in reverse 
direction and is the complement of nucleotides 5880-5901. 
Mismatching nucleotides between the primers and GLRaV-3 
sequence are shown in lowercase letters. Sequence comparison 

30 over these short primer regions to GLRaV-3 (isolate NY1) 

genome sequence showed a 90-95% identity, which suggested that 
these two isolates belong to the same virus (GLRaV-3) . 
Moreover, the primers prepared by Minafra (1994), which is 
hereby incorporated by reference, from the Italian isolate of 

35 GLRaV-3 produced an expected size of PCR product with 
templates prepared from the NY1 isolate of GLRaV-3 . 
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The remainder of the GLRaV-3 genome can be readily 
sequenced using the methods described herein and/or techniques 
well known to the art. 

Pvamp ip u - T^nH f -i^Hon anri rharacteri 7af i on of the 43 K 
ORE 

The complete nucleotide sequence of the GLRaV-3 HSP90- 
related gene" is given in Table 4. Initial sequencing work 
indicated that an open reading frame (ORF) encoding for a 
protein with a calculated Mr of 43K (Table 14) was downstream 
of the HSP70-related gene. This gene was selected for 
engineering because the size of its encoded product is similar 
to the GLRaV-3 coat protein gene. However, after sequence 
analysis, this incomplete ORF was located in the 3' terminal 
region of the HSP90-related gene. It is referred to herein as 
the incomplete GLRaV-3 HSP90 gene or as the 43K ORF. 

Pv^piP is - r^mm-pcF Knch peering Tnrompl TP GLRaV-3 
wgpgn nprf fnr v x nr**"* nn iri Plant" Tissues 

Two custom synthesized oligonucleotide primers, 5' primer 
(93-224, tacttatctagaaccATGGAAGCGAGTCGACGACTA (SEQ ID NO:29)) 
and 3' complementary primer (93-225, 

t c t tgagga t cca t ggAGAAACATCGTCGCATACTA (SEQ ID NO: 30)) that 
flank the 43K ORF were designed to amplify the incomplete 
HSP90 gene fragment by polymerase chain reaction. Addition of 
a restriction enzyme Ncol site in the primer facilitates 
cloning and protein expression (Table 15) (Slightom. Sana, 
100:251-255 (1991)). Using these primers, a product of the 
proper size (1.2 kb) was amplified by RT-PCR using GLRaV-3 
double- stranded RNA (dsRNA) as template. 

Table 14 shows the nucleotide sequence fragment 
containing the 43 kDa open reading frame that used to engineer 
the plant expression cassette, P BI525GLRaV-3hsp90. This 
fragment (nucleotides 9404 to 10,503 of the partial GLRaV-3 
genome sequence, Table 4) is located in the 3- portion of 
GLRaV-3 HSP90-related gene. Nucleotides shown in lower case 
facilitate cloning by adding Ncol restriction sites. 
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The PCR amplified product was treated with Ncol, isolated 
from a low-melting temperature agarose gel, and cloned into 
the same restriction enzyme treated binary vector pBI525 
(obtained from William Crosby, Plant Biotechnology Institute, 
Saskatoon, Sask., Canada), resulting in a clone pBI525GLRaV- 
3hsp90 (Figure 31) . A plant expression cassette, the EcoRI 
and Hindi I I fragment of clone pBI525GLRaV-3hsp90 # which 
contains properly engineered CaMV 35S promoters and a Nos 3' 
untranslated region, was excised and cloned into a similar 
restriction enzyme digested plant transformation vector, 
pBinl9 (Figure 14) (Clontech Laboratories, Inc.). Two clones, 
pBinl9GLRaV-3hsp90-12-3 and pBinl9GLRaV-3hsp90-12-4 that were 
shown by PCR to contain the proper size of the incomplete 
HSP90 gene were used to transform the avirulent A. 
tumefaciens, strain LBA4404 via electroporation (Bio-Rad) . 
The potentially transformed AgrroJbacteriurn was plated on 
selective media with 75 /zg/ml of kanamycin. Agrobacterium 
lines which contain the HSP90 gene sequence were used to 
transform tobacco (Nicotiana tobacum cv. Havana 423) using 
standard procedures (Horsch et al. # Science, 227:1229-1231 
(1985)). Kanamycin resistant tobacco plants were analyzed by 
PCR for the presence of the transgene. Transgenic tobacco 
plants with the transgene were self pollinated and seed was 
harvested. 

Trample i*7 - rns t nm-PCR Engineering of t . hft QRF 

The complete sequence of the GLRaV-3 hsp90 gene was 
reported in Table 4. However, in the present study, using two 
custom synthesized oligo primers {93-224, 

tact t ate tagaaccATGGAAGCGAGTCGACGACTA (SEQ ID NO:29) and 93- 
225, tcttgaggatccat gg AGAAACAT CGTCGCATACTA (SEQ ID NO:30)) and 
GLRaV-3 dsRNA as template, the incomplete HSP90 related gene 
sequence was amplified by RT-PCR which added an Ncol 
restriction enzyme recognition sequence (CCATGG) around the 
potential translation initiation codon (ATG) and another Ncol 
site, 29 nt downstream from the translation termination codon 
(TAA) (Table 14) . The PCR amplified fragment was digested 
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with Ncol, and cloned into the same restriction enzyme treated 
plant expression vector, pBI525. Under ampicillin selective 
conditions, hundreds of antibiotic resistant transformants of 
E. coli strain DH5a were generated. Clones derived from five 
5 colonies were selected for further analysis. Restriction 

enzyme mapping (Ncol or BamHI and EcoRV) showed that three out 
of five clones contained the proper size of the incomplete 
GLRaV-3 HSP90 sequence. Among them, two clones were 
engineered in the correct 5 1 -3 ' orientation with respect to 

10 the CaMV-AMV gene regulatory elements in the plant expression 
vector, pBI525. A graphical structure in the region of the 
plant expression cassette of clone P BI525GLRaV-3hsp90-12 is 
presented in Figure 14. 

The GLRaV-3 HSP90 expression cassette was removed from 

15 clone P BI525GLRaV-3hsp90-l2 by a complete digestion with 
Hindlll and EcoRI and cloned into the similar restriction 
enzyme treated plant transformation vector pBinl9. A clone 
designated as P Binl9GLRaV-3hsp90-12 was then obtained (Figure 
14) and was subsequently mobilized into the avirulent 

20 Agrobacterium strain LBA44 04 using a standard electroporation 
protocol (Bio-Rad) . Potentially transformed Agrobacterium 
cells were then plated on a selective medium (75 fzg/ml 
kanamycin) , and antibiotic resistant colonies were analyzed 
further by PCR with specific synthesized primers (93-224 and 

25 93-225) to see whether or not the incomplete HSP90 gene was 
still present. After analysis, clone LBA4404/pBinl9GLRaV- 
3hsp90-12 was selected and used to transform tobacco tissues. 

pv^Ip i7 - Tr*ngform a t-.i rm and Chara rfpri zation of Transgen i c 
30 Plants 

The genetically engineered A. tumefaciens strain, 
LBA4404/pBinl9GLRaV-3hsp90-12, was co-cultivated with tobacco 
leaf discs as described (Horsch (1985) ) , Potentially 
transformed tobacco tissues were selected on MS regeneration 
35 medium (Murashige et al., Phyqlnl ogia Planr . arum , 15:473-497 
(1962)) containing 300 /tg/ml of kanamycin. Numerous shoots 
developed from kanamycin resistant calli in about 6 weeks. 
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Rooted tobacco plants were obtained following growth of 
developed shoots on a rooting medium (MS without hormones) 
containing 300 /zg/ml of kanamycin. Eighteen independent, 
regenerated kanamycin resistant plants were transplanted in a 
5 greenhouse and tested for the presence of the HSP90-related 
gene by PCR. Fourteen out of eighteen selected kanamycin 
resistant putative transgenic lines were shown to contain a 
PCR product with the expected size of 1.2 kb. 

The genetically engineered AgroJbacterium tumefaciens 
10 strain LBA4404/pBinl9GLRaV-3hsp90-12 was also used to 

transfrom the grapevine rootstock C. 3309 (Vitia riparia x 
Vitis rupestris) . Embryogenic calli of C. 3309 were obtained 
by culturing anthers on MSE medium (Murashige and Skoog salts 
plus 0.2% sucrose, 1.1 mg/L 2,-4-D, and 0.2 mg/L BA. The 
15 medium was adjust to pH 6.5 and 0.8% Noble agar was added. 
After autoclaving 100 ml M-0654, 100 ml M-0529 and 1 ml 
vitamin M-3900 were added to the medium. Ater 60 days primary 
calli were indcued and transferred to hormone- free HMG medium 
(1/2 Murashige salts with 10 g/L sucrose, 4.6 g/L glycerol and 
20 0.8% Noble agar) for embryogenesis . Calli with globular or 
heart-shaped embryos were immersed for 15 min. in A. 
tumefaciens LBA4404/pBinl9GLRaV-3hsp90-12 suspended in MS 
liquid medium. The embryos were blotted on filter paper to 
remove excess liquid and transferred to HMG medium with 
25 acetosyringone (100 jiM) and kept for 48 hrs. in the dark at 
28°C. The calli were then washed 2-3 times in MS liquid 
medium plus cefotaxime (300 /zg/ml) and carbenicillin (200 
fig/ml) and transferred to HMG medium with the same antibiotics 
for 1-2 weeks. Subsequently, the embryogenic calli were 
30 transferred to HMG medium containing 20 or 20 fig /ml kanmamycin 
and 300 /xg/ml cefotaxime plus 200 jig/ml carbenicillin to 
select for transgenic embryos. After being on selective 
medium for 3-4 months, growing embryos were transferred to 
HMG, MGC (full -strength MS salts amended with 20 g/L sucrose, 
35 4.6 g/L glycerol, 1 g/L casein hydrolysate and 0.8% Noble 
agar), or MSE medium with kanamycin. After 4 months 
germinated embryos were transferred to baby food jars 
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containing rooting medium (Woody plant medium described by 
Lloyd and McCown (1981) "Commercially feasible 
micropropagation of mountain laurel, Kalmia latifola, by use 
of shoot tip culture," Proc. Intl. Plant Prop. Soc.211: 421-427 , 
supplemented with 0.1 mg/L BA, 3 g/L activated charcoal and 
1.5% sucrose. The pH was adjusted to 5.8 and Noble agar was 
added to 0.7%) . Plantlets with roots were transplatned to 
pots with artificial soil mix and grown in greenhouses. 88 
grapevine plants have cultivated, and several have been shown 
to contain the 43 kDa protein gene (by PCR) . 

Using the methods described above, engineering of the 
incomplete HSP90 gene of GLRaV-3 into plant expression and 
transformation vectors has been effected. The targeted gene 
sequence was shown to be integrated into the plant genome by 
PCR analysis of the putative transgenic tobacco plants. The 
engineered A. tumefaciens strain LBA4404/pBinl9GLRaV-3hsp90-12 
has been used to transform grapes and tobacco. Furthermore, 
this plant transformation vector can serve as a model for 
construction of other GLRaV-3 vectors, such as coat protein, 
RdRp, and HSP70, for which nucleotide sequences are disclosed 
herein. 

Since the first demonstration of transgenic tobacco 
plants expressing the coat protein gene of TMV resulted in 
resistance against TMV infection (Powell-Abel et al . , Science , 
232:738-743 (1986), which is hereby incorporated by 
reference) , the phenomenon of the coat protein-mediated 
protection has been observed for over 20 viruses in at least 
10 different taxonomic groups in a wide variety of 
dicotyledonous plant species (Beachy et al., flnmi ■ Rev . 
Phyi-nppt-hnT . . 28:451-74 (1990); Wilson, ?mr Nrltl ■ Acad ■ 
SH.. USA . 90:3134-3141 (1993)). If gene silencing (or co- 
suppression) (Finnegan et al., BWT»rtinQloqY . 12:883-888 
(1994); Flavell, ?mr NflM Prnfl firj , USA, 91:3490-3496 
(1994)) is one of the resistance mechanisms (Lindbo et al., 
Tho Plant- Ml . 5:1749-1759 (1993); Pang et al.. 
pWT^nnlnov . 11:819-824 (1993); Smith et al., Thf Plant 
Call, 6:1441-1453 (1994)), then one would expect to generate 
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transgenic plants expressing any part of a viral genome 
sequence to protect plants from that virus infection. Thus, 
in the present study, transgenic plants expressing the 43K ORF 
{or the incomplete hsp90 gene) are protected from GLRaV-3 
infection. 

Since tobacco {Nicotiana tobaccum cv. Havana 423) is not 
the host of GLRaV-3, direct evaluation of virus resistance was 
not possible in tobacco. However, after mechanical 
inoculation of N. benthamiana with grapevine leafroll infected 
tissue, Boscia (1995) have recovered a long closterovirus from 
N. benthamiana which is probably GLRaV-2. Thus, it is 
believed that other types of grapevine leafroll associated 
closteroviruses can also be mechanically transmitted to N. 
benthamiana. With transfer of an engineered 43K ORF from 
GLRaV-3 to W. benthamiana and to the grapevine root stock 
Couderc 3309, it is possible to evaluate the resistance of 
those plants against GLRaV-2 infection. 

gyampTe iff - rnaf Protei n -modi ated Protection and Other Forms 
of Pafh ngp^-HerivpH Resistance 

The successful engineering technique used in the above 
work can be utilized to engineer other gene sequences of 
GLRaV-3 which have since been identified. Among these, the 
coat protein gene of GLRaV-3 is the primary candidate since 
coat protein-mediated protection (Beachy (1990); Hull et al., 
C^f T?*v. Plant: Sci. , 11:17-33 (1992); Wilson (1993)) has 
been the most successful example in the application of the 
concept of pathogen-derived resistance (Sanford et al., jL_ 
Thftor. Biol. . 113:395-405 (1985)). Construction of plant 
expression vector (pEPT8/cpGLRaV-3) and Agrobacteriuni binary 
vector (pGA482pEPT8/cpGLRaV-3) was done following a strategy 
similar to the above. The GLRaV-3 coat protein gene was PCR 
amplified with primers (KSL95-5, 

actatttctagaaccATGGCATTTGAACTGAAATT (SEQ ID NO: 31) , and KSL95- 
6, ttctgaggatccat ggTATAAG CTCCCATGAATTAT (SEQ IDNO:32)) and 
cloned into pEPT8 after Ncol treatment. The expression 
cassette from pEPT8/cpGLRaV-3 (including double CaMV 35S 
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enhancers, 35S pro-tor. alfalfa mosaic virus leader sequence, 
GLRaV-3 coat protein gene, and 35S terminator) was digested 
with HindHI and cloned into P GA482G (Figure 15) . The 
: suiting bacteria, binary vector (pGA482G P EPTS cpGLRaV-3> 
was mobilized into A. tu.efaciens strain C58Z707 and used for 

' transformation of grapevines dependen t RNA 

Other gene sequences (e.g., ORF is, cne y 
polymerase) can also be used, as replicase-mediated protection 
has been effectively used to protect plants from virus 
infection (Carr et al., n^mrt Virol oqy , 4:339-347 
(1993); Golemboski et al., Lmr NflM ton* fH , USA , 
87-6311-6315 (1990) ) . The HSP70 homologue can also be used to 
generate transgenic plants that are resistant to all types of 
grapevine leafroll associated enteroviruses because 
significant sequence similarity is observed over HSP70 
conserved domains. Moreover, the phenomenon of RNA-mediated 
protection has also been observed (de Haan et al., 

, m 1111.1137 (1992); Farinelli et al., MO l.. 

Rir ^hnnloov. 10:1133-1137 

c ■7R4-292 (1993); Kawchuk et al. f 
?1n nt- Mirrr h" T "^ act -- 6:284-292 . 

rinm - (1991); Lindbo et al., 

nn1 pi,., MicrohP TnreraCt. 4:247-253 (1991), 

v^lncv . 189:725-733 (1992); Lindbo et al., ^ Plant 

r^oract . 5:144-153 (1992); Lindbo et al., *™™™ » i 
ZZZJ . 4:369-379 (1993); Pang (1993); Van Der Wxlk et al . , 
^!T^i W | Bl . , 17:431-440 (1991)). Thus, untranslatable 
trZcript versions of the above mentioned GLRaV- 3 genes also 
produce leafroll resistant transgenic plants. 

Another form of pathogen-derived resistance effective xn 
the control of plant viral disease is the use of antisense 
RNA Transgenic tobacco plants expressing the antisense 
sequence of the coat protein gene of cucumber mosaic virus 
(CMV, showed a delay in symptom expression by CMV infection 
Cuozzo et al., Ein^^ol^, 6:549-554 (1988,). Transgenic 
plants expressing either potato virus X (PVX) " at *™ ~ 
its antisense transcript were protected from infection by PVX. 
However, plants expressing antisense RNA were protectee only 
at low inoculum concentration. The extent of this protection 
mediated by antisense transcript is usually lower than 
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transgenic plants expressing the coat protein (Hemenway et 
al vumn J. . 7:1273-1280 (1988)). This type of resistance 
has' also been observed in bean yellow mosaic virus (Hammond et 
al., FY^hmnav. 81:1174 (1991), tobacco etch virus 
(Lindbo et al., Mrnl*tY. 189:725-733 (1992)) potato, vxrus Y 
(Farinelli (1993)), and zucchini yellow mosaic virus (Fang et 
al., „-] ? ^ MWohP Tnt^Ct., 6:358-367 (1993)). 
However, high level of resistance mediated by antisense 
sequence was observed to be similar to potato plants (Russet 
Burbank) expressing potato leafroll virus coat protein 
(Kawchuk (1991) ) . Besides using antisense transcrxpt of the 
virus coat protein gene, other virus genome sequences have 
also been demonstrated to be effective. These included the 
51-nucleotide sequences near the 5^ end of TMV RNA (Nelson et 
al fifiaa (Abst) , 127:227-232 (1993)) and noncoding region of 
turnip yellow mosaic virus genome (Zaccomer et al . , fifiaa, 87- 

94 (1993) ) . , . . 

GLRaV-3 has been shown to be transmitted by mealybugs and 
in some cases it has been shown to spread rapidly in vineyards 
(see hereinabove) . This disease will become more of a problem 
if mealybugs become resistant to insecticides or if 
insecticide use is restricted. Thus, the development of 
leafroll resistant grapevines is environmentally sound and 
good for the economics of grape growing. 

Although grapevine is a natural host of Agrobactaraun, (A. 
vitim is the causal agent of the grapevine crown gaU 
disease) , transformation of grapevine has proven difficult 
(Baribault et al., Fitrp . Bot., 41:1045-1049 (1990); 
Baribault et al., Elan^ll-*^ 3:137-140 (1989) ; Coloy 
et al., ^^^axt^i., 116:356-361 (1991 ; Gue^ecj, 
al ., Ti,^ Q.,an-^> 20:211-216 1990 ; Hebert 

et al., Plant Report 12:585-589 (1993) ; Le Gall et 

al.,Pl,nr BrltlM*. 102:161-170 (1994); Martinelli et al 
^ »n.1 . Gene tic. 88:621-628 (1994); Mullms et al . , 
^/^incv . 8:1041-1045 (1990)). Recently, an efficient 
regeneration system using proliferative somatic e^ryogenesis 
and subsequent plant development has been developed from 
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zygotic embryos of stenospermic seedless grapes (Mozsar, J. 
et al v^is. 33:245-246 (1994); Emerschad (1995)). Using 
this regeneration system, Scorza et al., Flint ^ 1 Resorts , 
14-589-592 (1995) succeeded in obtaining transgenic grapevines 
through zygotic-derived somatic embryos after parto.de- 
wounding/A. turcefaciens treatment. Using a Biolistic device, 
tiny embryos were shot with gold particles (1.0 (» a 
diameter) . The wounded embryos were then co-cultivated with 
A tumefaciens containing engineered plasmids carrying the 
selection marker of kanamycin resistance and P -glucuronidase 
(GUS) genes. Selection of transgenic grapevines was earned 
out with 20 M g/«1 kanamycin in the initial stage and then 40 
M g/ml for later proliferation. Small rooted seedlings were 
obtained from embryogenic culture within 5 months of 
bombardment/A. turcefaciens (Scorza (1995)). Transgenic 
grapevines were analyzed by PGR and Southern hybridization, 
and shown to carry the transgenes. The above-mentioned 
grapevine transformation approach has been carried out in the 
current investigation to generate transgenic grapevines 
expressing GLRaV-3 genes. Evaluation of any potential 
leafroll resistance on transgenic grapevines is carried out 
using insect vectors' or by grafting. 

fTn rr i- *i Q - Pr - inrf- - f ant-iH-^in, FnrnqnMTKT HLRaV3 

E. coli harboring the clone pCP10-l, which contains the 
major portion of the coat protein gene of GLRaV3 (Figure 4) , 
was used to express the coat protein and the P -galactosidase 
fusion protein. About 500 ml of LB medium containing 50 ^g/ml 
of ampicillin was inoculated with a single colony and 
incubated with rigorous shaking for overnight until log-phase 
growth. Expression of the fusion protein was induced by the 
addition of 1 mM IPTG. Bacteria were harvested by 
centrifugation at 5,000 rpm for 10 min. The bacterial 
envelope was broken by sonication. After low speed 
centrifugation to remove cell debris, the fusion protein was 
precipitated by the addition of saturated ammonium sulfate 
then resuspended in PBS buffer and electrophoresed m a SDS- 
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pclyacrylamide gel (SDS-PAGE) . The fusion protein band was 
excised after soaking the SDS-PAGE gel in 0.25M KC1 to locate 
the protein band. The protein was eluted with buffer (0.05M 
Tris-HCl, P H 7.9, 0.1% SDS, 0.1 mM EDT and 0.15 M NaCl) and 
precipitated by the addition of trichloroacetic acid to a 
final concentration of 20%. 

An antiserum was prepared by immunization of a rabbit 
with 0.5-1 mg of the purified protein emulsified with Freund-s 
completed adjuvant followed by two more weekly injections of 
0 5-1 mg protein emulsified with Freund's incomplete adjuvant. 
After the last injection, antiserum was prepared from blood 
collected from the rabbit every week for a period of 4 months. 

On Western blot analysis, the antibody gave a specific 
reaction to the 41K protein in GLRaV3 infected tissue as well 
as to the fusion protein itself (50K) and generated a pattern 
similar to the pattern seen in Figure 3. This antibody was 
also successfully used as a coating antibody and as an 
antibody-conjugate in an enzyme linked immunosorbent assay 

(ELISA) 

The above method of producing antibody to GLRaV3 can also 
be applied to other GLRaV-3 gene sequences of the present 
invention. The method affords a large amount of highly 
purified protein from E. call from which antibodies can be 
readily obtained. It is particularly useful in the common 
case where it is rather difficult to obtain sufficient amount 
of purified virus from GLRaV3 infected grapevine t»»u.s. 

Although the invention has been described in detail for 
the purpose of illustration, it is understood that ™* 
is solely for that purpose, and variations can be made there n 
by those skilled in the art without departing from ^ spirit 
and scope of the invention which is defined by the following 
claims . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

,i, APPLICANT: Cornell Research Foundation, Inc. 

t oaf roll Virus Proteins and 
(ii) TITLE OF INVENTION : Grapevine Leaf roll 

Their Uses 
(iii) NUMBER OF SEQUENCES: 32 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Greenlee, Winner and Sullivan, P.C. 

(B) STREET : 5370 Manhattan Circle, Suite 201 

(C) CITY: Boulder 

(D) STATE: Colorado 

(E) COUNTRY: US 

(F) ZIP : 80303 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM : PC-DOS/MS-DOS 

, D , SOFTWARE : Patentln Release #1.0, Version #1.30 



(Vi) CURRENT APPLICATION DATA: 
{A) APPLICATION NUMBER: 
(B) FILING DATE: 
3Q (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/009,008 

(B) FILING DATE: 20-DEC-1995 

(viii) ATTORNEY / AGENT INFORMATION : 

(A) NAME: Ferber, Donna M. 

(B) REGISTRATION NUMBER: 33,878 

(C) REFERENCE /DOCKET NUMBER: 99-96 ZA 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (303) 499-8080 

(B) TELEFAX : (303) 499-8089 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

20 GTGTCTACTT ACGCGAAGAG TGTGATGAAC GACAATTTCA ATATCCTTGA GACCCTGGTA 60 

ACTTTGCCCA AGTCCTTTAT AGTCAAAGTA CCTGGTTCGG TGCTGGTTAG CATAACCACT 120 

TCGGGCATTT CCGACAAACT TGAACTTCGG GGCGCGTTCG ACGTTTCTAA AAAGAATTTC 180 

25 

TCCAGGAGGT TACGTTCGAG TCGTTTGCGC GTATTTTCTA GGGCTATTGT GGAGGATACG 240 

ATCAAGGTTA TGAAGGGCAT GAAATCAGAG GATGGTAAAC CACTCCCTAT AGCCGAGGAT 300 

3 0 TCCGTGTACG CGTTCATGAC AGGCAATATG TCAAACGTTC ATTGCACTAG GGCTGGTTTG 360 

CTCGGGGGCT CAAAGGCTTG CGCGGCTTCT TTAGCTGTGA AGGGTGCAGC TTCACGCGCT 42 0 

ACTGGAACAA AACTCTTTTC AGGTCTCACA TCCTTTCTTT CCGCCGGTGG TCTGTTTTAC 480 

35 

GATGAAGGCT TGACGCCCGG AGAGAGGCTT GATGCACTAA CGCGCCGTGA ACATGCTGTG 54 0 

AATTCACCTG TAGGCCTCTT AGAACCTGGA GCTTCGGTTG CGAAGCGGGT CGTTTCCGGA 600 

40 ACGAAAGCTT TTCTGTCAGA ATTGTCATTG GAGGACTTCA CCACTTTCGT CATAAAAAAT 660 

AGGGTGCTTA TTGGTGTTTT TACTCTTTCC ATGGCTCTCA CTCCGGTGGT CTGGAAGTAC 720 

AGAAGGAATA TCGCGCGAAC TGGCGTGGAT GTTTTCCACC GTGCTCGTTC GGGTACCGCG 780 

45 
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840 



900 



960 
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nrrTrGTTAG CTGGTGACGC TGCTCGTGGC 
GCCATCGGTT TACAATGTCT TAGTGGAGGA AGGTCGTTAG t 

MsrsCTATCT TCGGCGGTTG CGGTGACCAG AAATACAGTG 
GCGTTAACAG TGACTCGAGG AGGGCTATCT 

G CTAGGCGTC AGGTACCATT GGCGTTGCTT TCGTTTTCCA CGTCTTACGC AGTCAGTGGT 

TGCACTTTGT TAGGTATTTG GGCTCATGCT CTCCCTAGGC ATTTGATGTT CTTCTTTGGC 1020 

CTAGGGACGC TCTTCGGGGT GAGTGCCAGT ACCAATTCTT GGTCGCTTGG GGGCTATACG 1080 

AACAGTCTGT TCACCGTACC GGAATTAACT TGGGAAGGGA GGAGTTACAG ATCTTTATTG 1140 

CCCCAAGCAG CTTTAGGTAT TTCTCTCGTT GTGCGCGGGT TGTTAAGTGA AACTGTGCCA 1200 

CAACTAACGT ACGTACCGCC GATTGAAGGT CGGAATGTTT ATGATCAGGC ACTAAATTTT 1260 

TATCGCGACT TTGACTATGA CGATGGTGCA GGCCCATCCG GGACGGCTGG TCAAAGCGAT 1320 

CCTGGAACCA ATACTTCGGA TACTTCTTCG GTTTTCTCTG ACGATGGTTT GCCCGCTAGT 1380 

GGCGGTGGCT TCGACGCGCG CGTTGAGGCA GGTCCCAGCC ATGCTGTTGA TGAATCACCA 1440 

AGGGGTAGTG TTGAGTTCGT CTACAGAGAA CGTGTAGATG AACATCCGGC GTGTGGTGAA 1500 

GCTGAAGTTG AAAAGGATCT AATAACACCA CTTGGTACAG CTGTCTTAGA GTCGCCCCCC 1560 

GTAGGTCCTG AAGCTGGGAG CGCGCCCAAC GTCGAGGACG GTTGTCCGGA GGTTGAAGCT 1620 

GAGAAATGTT CGGAGGTCAT CGTTGACGTT CCTAGTTCAG AACCGCCGGT ACAAGAAGTC 1680 

CTTGAATCAA CCAATGGTGT CCAAGCTGCA AGAACTGAAG AGGTTGTGCA GGGCGACACA 1740 

TGTGGAGCTG GGGTAGCTAA ATCAGAAGTG AGTCAACGTG TGTTTCCTGC GCAAGTACCC 1800 

GCACATGAAG CTGGTCTTGA GGCATCTAGT GGCGCGGTCG TGGAGCCATT GCAAGTTTCT I860 

GTGCCAGTAG CCGTAGAGAA AACTGTTTTA TCTGTCGAGA AGGCGCGTGA GCTAAAGGCG 1920 

GTAGATAAGG GCAAGGCGGT CGTGCACGCA AAGGAAGTCA AGAATGTACC GGTTAAGACG 1980 

TTACCACGAG GGGCTCTAAA AATTAGTGAG GATACCGTTC GTAAGGAATT GTGCATGTTT 2040 

AGAACGTGTT CCTGCGGCGT GCAGTTGGAC GTGTACAATG AAGCGACCAT CGCCACTAGG 2100 

TTCTCAAATG CGTTTACCTT TGTCGATAGC TTGAAAGGGA GGAGTGCGGT CTTTTTCTCA 2160 
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AAGCTGGGTC AGGGGTATAC CTATAATGGT GGTAGCCATG TTTCATCAGG GTGGCCTCGT »» 

GCCCTAGAGG ATATCTTAAC QGCAATTAAG TACCCAAGCG TCTTCGACCA CTGTTTAGTG ».. 

S CAGAAGTACA AGATGGGTGG AGGCGTACCA TTCCACGCTG ATGACGAGGA GTGCTATCCA ,3,0 

TCAGATAACC CATC^AC GGTCAATCTC GTGGGGAAGG CAAACTTCTC GACTAAGTGC » M 

^^AGGGTG GTAAGGTCAT GGTCATAAAC GTAGCTTCGG GTGACTATTT TCTTATGCCT »« 
10 TGCGGTTTTC AAAGGACGCA CTTGCATTCA GTAAACTCCA TCGACGAAGG GCGCATCAGT 

TTGACGTTCA GGGCAACTCG GCGCGTCtTT GGTGTAGGCA GGATGTTGCA GTTAGCCGGC .... 

1S GGCGTGTCGG ATGAGAAGTC ACCAGGTGTT CCAAACCAQC AACCACAGAG CCAAGGTGCT U» 

ACCAGAACAA TCACACCAAA ATCGGGGGGC AAGGCTCTAT CIGAGGGAAG TGGTAGGGAA ,70. 
GTCAAGGGGA GGTCGACATA CTCGATATGG TGCGAACAAG ATTACGTTAG GAAGTGTGAG 
20 TCGCTCAGGG CTGATAATCC AGTGATGGCT CTTAAACCIG GCTACACCCC AATGACATTT 

^^A AAGCCGGGAC CTCTGAAGAT GCCGTCGTGG AGTAC^GAA GTATCTGGCT ,0=0 

fftGGGCGTTG CTTATGGCTA GAAATATTGC CGTCACTACC 2940 
25 ATAGGCATTG GGAGGACATA CAGGGCGTTO l-li 

GCCGAAGGTG TTCTGAAAGT ACCTAATCAA GTTTATGAAT CACTACCGGG CTTTCACGTT ,000 

TACAAGTCGG OCACAGATCT CATTTTTCAT TCAACACAAG ACGGCTTGCG TGTGAGAGAC 30*0 

30 CTACCGTACG TATTCATAGC TGAGAAAGGT ATTTTTATCA AGGGCAAAGA TGTCGACGCG ». 

GTAGTAGCTT TGGGCGACAA TCTGTCCGTA TGTGATGATA TATTGGTTTT CCATGATGCT 3,00 

35 ATTAATTTGA TGGGTGCACT GAAAGTTGCT CGATGTGGTA TGGTGGGTGA ATCATTTAAG 3,« 

TCGTTCGAAT ACAAATGCTA TAATGCTCCC ~CG GTAAGACGAC GATGCTAGTG 3300 

GCCACCATTA CGGCTAACGT GGGAAGTTCT 3360 



GACGAATTTG TCAAGTCACC CAATAGCACG 

ATTGTCAGGG GAATGTATAA AAGGGTTTTG 3480 



40 r , TCCGAATT TGGAAGGTCT CAACAGTGCT 3420 

GAGGACATAA ATATGGCGGT GAAGAAGAGA GATCCGAATT 



ACCACAGTTA ACTCCAGGGT GGTTAACTTT 
45 GTGGATGAGG TGTACATGAT GCATCAAGGC TTACTACAAC TAGGCGTCTT CGCAACCGGC 3S.0 
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GOT CGGAAG GCCTCTTTTT TGGAGACA.A »~ »^ CCGGGAGAAG 
GTGTTTAGGA ^TTGTGC TGTATTTGTT CCAAAGAAGG ««™r ATACACTTCT M .. 
^ACA GGTGTCCGTT AGATG^C TACTTGTTGT CCTCAATOAC CGTAAGGGGA 3„0 

cxgtccaaaa ggccaattgg aaccacgat — «™ »»»•« TC1CGTEr>C 3 " 0 



„«» GAAGTCGGAT ATGAAGAGGT CGTTGAAGGG AAAAGGAAAA 

GAAACACCAG TGATGACAQT GCATGAAGCA CAGGGAAAAA CATTCAGTGA TGTGGTATTG 
TTTAGGACGA AGAAAGCCGA TGACTCCCTA TKACTAAAC AACCGCATAT ACTTGTTGGT 
TTGTCGAGAC ACACACGCTC ACTCGTTTAT GCCGCTCTGA GCTCAGAGTT GGACGATAAG 

GTCGGCACAT ATATTAGCGA CGCGTCGCCT CAATCAQTAT CCGACGCTTT GCTTCACACG 

TTCGCCCCGG CTGGTTGCTT TCGAGGTATA TGA 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1390 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



3900 
3960 
4020 
4080 
4140 
4173 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

0 

V c 10 lb 

1 5 

Giu r* u * -» » - •« pm ne vai hyt :? pr ° oiy 
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Ser val Leu Val Ser He Thr Thr Ser Oly He Ser Asp Lys Leu Olu 
35 40 

50 « 60 

Arg ser Ser Arg Leu Arg Val Phe Ser Arg Ala Xle Val Olu Asp Thr 
65 70 75 
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Ile L ys val Met Lys Gly Met Lys Ser Olu Asp Gly Lys Pro Leu Pro 
85 90 

Ue Ala olu Asp ser Val Tyr Ala Phe Met Thr Oly Asn Met Ser Asn 

100 105 

val His Cys Thr Arg Ala Oly Leu Leu Oly Oly Ser Lys Ala Cys Ala 
115 120 " 

130 

Leu Phe Ser Gly Leu Thr Ser Phe Leu Ser Ala Oly Gly Leu Phe Tyr 
145 * 5 ° 

As p 01. Oly Leu Thr Pro Cly Olu Arg Leu Asp Ala Leu Thr Arg Arg 
165 170 15 

Olu His Ala Val Asn Ser Pro Val Oly Leu Leu Olu Pro Oly Ala Ser 
180 185 

, „ Val Val Ser cly Thr Lys Ala Phe Leu Ser Olu Leu 
Val Ala Lys Arg Val Val ser t,iy / 

195 200 205 

210 

Gly val Phe Thr Leu Ser Met Ala Leu Thr Pro Val Val Trp Lys Tyr 
225 230 

Arg A rg Asn He Ala Arg Thr Oly Val Asp Val Phe His Arg Ala Arg 

245 

Ser Oly Thr Ala Ala He Oly Leu Oln Cys Leu Ser Oly Oly Arg Ser 



260 265 
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. Ala Ala Arg Oly Ala Leu Thr Val Thr Arg Gly Gly 
Leu Ala Gly Asp Ala Ala Arg y 

275 280 

c xl. val Ala val Thr Arg Asn Thr val Ala Arg Arg Gin 

Leu Ser Ser Ala Val Aia v 

295 

290 

rrv,^ qer Tvr Ala Val Ser Gly 
VI Pro Leu Ala Leu Leu Ser Phe Ser Thr Ser Ty 

305 310 

, - *-\ a His Ala Leu Pro Arg His Leu Met 
C ys Thr Leu Leu Gly He Trp Ala His Al^ ^ 

325 

nv,o riv Val Ser Ala Ser Thr Asn 
Phe Phe Phe Gly Leu Gly Thr Leu Phe Gly 

340 345 

, alv Gly ^ Thr Asn Ser Leu Phe Thr Val Pro Glu 
Ser Trp Ser Leu Gly Gly Tyr in ^ 

355 360 

«. pr Tvr Arg Ser Leu Leu Pro Gin Ala Ala 
Le u Thr Trp Glu Gly Arg Ser Tyr Arg 

370 375 
Le u Gly ne Ser Leu Val Val Arg Cly Leu Leu Ser Glu Thr Val Pro 
385 390 

tio c\u GlV Arg Asn Val Tyr Asp Gin 
G ln Leu Thr Tyr Val Pro Pro lie Glu Gly 4l5 

405 



0 



55 



40 



. Phe Tvr Arg As P Phe Asp Tyr Asp Asp Gly Ala Gly Pro 

L,eu Asn Pne Tyr «xy ~f 43Q 

420 4 " 

r , v Thr Asn Thr Ser Asp Thr 
Gly Thr Ala Gly Gin Ser Asp Pro Gly ^ 



Ser 

435 440 

len rlv Leu pro Ala Ser Gly Gly Gly Phe 
Ser Ser Val Phe Ser Asp Asp Gly Leu 

450 

p A ia val Asp Glu Ser Pro 

Asp Ala Arg Val Glu Ala Gly Pro Ser H» Ala 

465 

„* v*i Tvr Arg Glu Arg Val Asp Glu His Pro 
Arg Gly Ser Val Glu Phe Val Tyr Arg ^ 

485 

, m,ws asp Leu lie Thr Pro Leu Gly 
Ala cys Gly Glu Ala Glu Val Glu Lys Asp 

_ b U b 

500 
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Thr Ala val Leu Glu Ser Pro Pro Val Gly Pro Glu Ala Gly Ser Ala 
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, , ni„ rv. Pro Glu Val Glu Ala Glu Lys Cys Ser 
Pro Asn Val Glu Asp Gly Cys Pro giu 

530 "5 "0 

Olu Val He Val Asp Val Pro Ser Ser Glu Pro Pro Val Gin Glu Val 
545 550 5 " 

~ n «=i rm Ala Ala Arg Thr Glu Glu Val Val 
Leu Glu Ser Thr Asn Gly Val Gin Ala Aia a 

565 570 " 5 

Oln Gly Asp Thr Cys Gly Ala Gly Val Ala Lys Ser Glu Val Ser Gin 
580 565 

" Arg Val Phe Pro Ala Gin Val Pro Ala His Glu Ala Gly Leu Glu Ala 

595 "0 "5 

Ser Ser Gly Ala Val Val Glu Pro Leu Gin Val Ser Val Pro Val Ala 
610 

„ al r i n t jV s Ala Arg Glu Leu Lys Ala 
Val Glu Lys Thr Val Leu Ser Val Glu Lys Ala g 

625 630 " 5 

val Asp Lys Gly Lys Ala Val Val His Ala Lys Glu Val Lys Asn Val 
645 650 

riv Ma Leu Lys lie Ser Glu Asp Thr 
Pro Val Lys Thr Leu Pro Arg Gly Ala Leu Ly 

660 665 60 

val Arg Lys Glu Leu Cys Met Phe Arg Thr Cys Ser Cys Gly Val Gin 
675 «" 

Leu Asp Val Tyr Asn Glu Ala Thr He Ala Thr Arg Phe Ser Asn Ala 
690 

Ti, e nv ara Ser Ala Val Phe Phe Ser 
Phe Thr Phe Val Asp Ser Leu Lys Gly Arg ser a ^ 

705 710 

Lys Leu Gly Glu Gly Tyr Thr Tyr Asn Gly Gly Ser His Val Ser 
725 730 

* a T *„ cm Asp He Leu Thr Ala He Lys Tyr Pro 
Gly Trp Pro Arg Ala Leu Glu Asp ij.«s 



Ser 
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W . E cvs wu val Gin Lys Tyr Lys Met Gly oly Oly 
Ser VaX Phe Asp His Cys Leu ^ 

760 

755 



20 



30 



35 



, is „. w «p «- ~ "! s " MP M " "° 



val Pro Phe His Ala - 780 

775 

770 



Ala Asn Phe Ser Thr Lys Cys 



,95 800 

785 790 



Ile Leu Thr Val Asn Leu Val Gly Lys -a » 
790 

val Met val He Asn Val Ala Ser Gly Asp Tyr 
10 Arg Lys Gly Gly Lys val Met va ^ 

x 805 

p « cvs Gly Phe Gin Arg Thr His Leu His Ser Val Asn 
Phe Leu Met Pro Cys Gly m ^ 830 

820 

15 Tlp ser Leu Thr Phe Arg Ala Thr Arg Arg 

Ser lie Asp Glu Gly Arg He Ser Le ^ 

835 640 
Val Phe Gly Val tj-ty 860 



8 55 
850 



&an Gin Gin Pro Gin Ser Gin Gly Ala 
Ol« Lys ser Pro Gly Val Pro Asn Gin ^ 

865 



e t- civ Gly Lys Ala Leu Ser Glu Gly 
885 

„. . ra Ser Thr Tyr Ser Ile Trp Cys Glu 
Glu Val Lys Gly Arg Ser Tnr y 

905 

900 

«i « w . c* «- «p « «» m - r. M " pr ° v " 



Gin Asp Tyr Val Arg 1.7- - g25 

« L «. ~ «v ~ * ~ - Tbt ~ oiu vai val 

935 

930 

v«l val Glu Tyr Leu Lys Tyr Leu Ala 
Ala Gly Thr Ser Glu Asp Ala Val Y 96Q 

945 950 

^ »«i Ala Leu Leu Met Ala Arg Asn Ile 
40 Ile Gly He Gly Arg Thr Tyr Arg Ala Le^ ^ 

965 

, „ i t*« lvs Val Pro Asn Gin Val Tyr 
Ala val Thr Thr Ala Glu Gly Val Leu Ly. 

985 

980 

45 
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Glu Ser Leu Pro Gly Phe His Val Tyr Lys Ser Gly Thr Asp Leu lie 
995 1000 1005 

Phe His Ser Thr Gin Asp Gly Leu Arg Val Arg Asp Leu Pro Tyr Val 
1010 1015 1020 

Phe lie Ala Glu Lys Gly He Phe He Lys Gly Lys Asp Val Asp Ala 
1025 1030 1035 1040 

Val val Ala Leu Gly Asp Asn Leu Ser Val Cys Asp Asp He Leu Val 
1045 1050 1055 

Phe His Asp Ala lie Asn Leu Met Gly Ala Leu Lys Val Ala Arg Cys 
1060 1065 1070 

Gly Met Val Gly Glu Ser Phe Lys Ser Phe Glu Tyr Lys Cys Tyr Asn 
1075 1080 1065 

Ala Pro Pro Gly Gly Gly Lys Thr Thr Met Leu Val Asp Glu Phe Val 
1090 1095 HOO 

Lys Ser Pro Asn Ser Thr Ala Thr He Thr Ala Asn Val Gly Ser Ser 
H05 mo 1115 1120 

Glu Asp He Asn Met Ala Val Lys Lys Arg Asp Pro Asn Leu Glu Gly 
H25 1130 H35 

Leu Asn Ser Ala Thr Thr Val Asn Ser Arg Val Val Asn Phe He Val 
1140 H45 "SO 

Arg Gly Met Tyr Lys Arg Val Leu Val Asp Glu Val Tyr Met Met His 
1155 1160 H65 

Gin Gly Leu Leu Gin Leu Gly Val Phe Ala Thr Gly Ala Ser Glu Gly 
1170 11" H80 

Leu Phe Phe Gly Asp He Asn Gin He Pro Phe He Asn Arg Glu Lys 
1185 1190 I 195 1200 

Val Phe Arg Met Asp Cys Ala Val Phe Val Pro Lys Lys Glu Ser Val 
1205 121° 1215 

Val Tyr Thr Ser Lys Ser Tyr Arg Cys Pro Leu Asp Val Cys Tyr Leu 
1220 1225 1230 
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1235 

, i« lvs Pro val val Arg Ser Leu Ser Lys Arg 
val val ser Qly Lys Asp Lys Pro 

1250 1255 

* *m Val Ala Glu He Asn Ala Asp Val Tyr 
Pro He Gly Thr Thr Asp Asp Val Ala ^ 

1265 



20 



30 



35 



40 



Thr «. «. W M. « « HP- «« « ~ ~ 



10 Leu Cys Met Thr bin « 1295 

1U 1285 " su 

v=i Met Thr Val His Glu Ala Gin Gly 
Oly Lys Gly Lys Glu Thr Pro Val Met ^r ^ 

15 „ , SD val val Leu Phe Arg Thr Lys Lys Ala Asp Asp 

Lys Thr Phe Ser Asp Val vai u 

1315 1320 

^ d™ His He Leu Val Gly Leu Ser Arg His 
Ser Leu Phe Thr Lys Gin Pro His ^ 



1330 



»i T«« qpr Ser Glu Leu Asp Asp Lys 
Thr Arg ser Leu Val Tyr Ala Ala Leu Ser Se 

1345 

« n» ser Pro Gin Ser Val Ser Asp Ala 
25 val Gly Thr Tyr He Ser Asp Ala Ser Pro ^ 

" 13G5 

Leu Leu His Thr Phe Ala Pro Ala Gly^cys Phe Arg Gly XU 



13B0 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : double 

(D ) TOPOLOGY: not relevant 

MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(i v) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

ATGAATTTTG GACCGACCTT CGAAGGGGAG TTGGTACGGA AGATACCAAC AAGTCATTTT 60 

GTAGCCGTGA ATGGGTTTCT CGAGGACTTA CTCGACGGTT GTCCGGCTTT CGACTATGAC 120 

TTCTTTGAGG ATGATTTCGA AACTTCAGAT CAGTCTTTCC TCATAGAAGA TGTGCGCATT 180 

TCTGAATCTT TTTCTCATTT TGCGTCGAAA ATAGAGGATA GGTTTTACAG TTTTATTAGG 240 

TCTAGCGTAG GTTTACCAAA GCGCAACACC TTGAAGTGTA ACCTCGTCAC GTTTGAAAAT 300 

AGGAATTCCA ACGCCGATCG CGGTTGTAAC GTGGGTTGTG ACGACTCTGT GGCGCATGAA 360 

15 CTGAAGGAGA TTTTCTTCGA GGAGGTCGTT AACAAAGCTC GTTTAGCAGA GGTGACGGAA 420 

AGCCATTTGT CCAGCAACAC GATGTTGTTA TCAGATTGGT TGGACAAAAG GGCACCTAAC 480 

GCTTACAAGT CTCTCAAGCG GGCTTTAGGT TCGGTTGTCT TTCATCCGTC TATGTTGACG 54 0 

TCTTATACGC TCATGGTGAA AGCAGACGTA AAACCCAAGT TGGACAATAC GCCATTGTCG 600 

AAGTACGTAA CGGGGCAGAA TATAGTCTAC CACGATAGGT GCGTAACTGC GCTTTTTTCT 660 

25 TGCATTTTTA CTGCGTGCGT AGAGCGCTTA AAATACGTAG TGGACGAAAG GTGGCTCTTC 720 

TACCACGGGA TGGACACTGC GGAGTTGGCG GCTGCATTGA GGAACAATTT GGGGGACATC 780 
CGGCAATACT ACACCTATGA ACTGGATATC AGTAAGTACG ACAAATCTCA GAGTGCTCTC 84 0 

30 ATGAAGCAGG TGGAGGAGTT GATACTCTTG ACACTTGGTG TTGATAGAGA AGTTTTGTCT 900 

ACTTTCTTTT GTGGTGAGTA TGATAGCGTC GTGAGAACGA TGACGAAGGA ATTGGTGTTG 960 

35 TCTGTCGGCT CTCAGAGGCG CAGTGGTGGT GCTAACACGT GGTTGGGAAA TAGTTTAGTC 1020 

TTGTGCACCT TGTTGTCCGT AGTACTTAGG GGATTAGATT ATAGTTATAT TGTAGTTAGC 1080 

GGTGATGATA GCCTTATATT TAGTCGGCAG CCGTTGGATA TTGATACGTC GGTTCTGAGC 114 0 
40 GATAATTTTG GTTTTGACGT AAAGATTTTT AACCAAGCTG CTCCATATTT TTGTTCTAAG 1200 
TTTTTAGTTC AAGTCGAGGA TAGTCTCTTT TTTGTTCCCG ATCCACTTAA ACTCTTCGTT 1260 
4 5 AAGTTTGGAG CTTCCAAAAC TTCAGATATC GACCTTTTAC ATGAGATTTT TCAATCTTTC 1320 
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GTCGATCTTT CGAAGGGTTT CAATAGAGAG GACGTCATCC AGGAATTAGC TAAGCTGGTG 13 8 0 

ACGCGGAAAT ATAAGCATTC GGGATGGACC TACTCGGCTT TGTGTGTCTT GCACGTTTTA 1440 

AGTGCAAATT TTTCGCAGTT CTGTAGGTTA TATTACCACA ATAGCGTGAA TCTCGATGTG 1500 

CGCCCTATTC AGAGGACCGA GTCGCTTTCC TTGCTGGCCT TGAAGGCAAG AATTTTAAGG 1560 

TGGAAAGCTT CTCGTTTTGC CTTTTCGATA AAGAGGGGTT AA 1602 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asn Phe Gly Pro Thr Phe Glu Gly Glu Leu Val Arg Lys He Pro 
1 5 10 15 

Thr Ser His Phe Val Ala Val Asn Gly Phe Leu Glu Asp Leu Leu Asp 
20 25 30 

Gly Cys Pro Ala Phe Asp Tyr Asp Phe Phe Glu Asp Asp Phe Glu Thr 
35 40 45 

Ser Asp Gin Ser Phe Leu He Glu Asp Val Arg He Ser Glu Ser Phe 
50 55 60 

Ser His Phe Ala Ser Lys lie Glu Asp Arg Phe Tyr Ser Phe lie Arg 
65 70 75 80 

Ser Ser Val Gly Leu Pro Lys Arg Asn Thr Leu Lys Cys Asn Leu Val 
85 9° 95 
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Thr Phe Glu Asn Arg Asn Ser Asn Ala Asp Arg Gly Cys Asn Val Gly 
100 105 110 

Cys Asp Asp Ser Val Ala His Glu Leu Lys Glu He Phe Phe Glu Glu 
115 120 125 

Val Val Asn Lys Ala Arg Leu Ala Glu Val Thr Glu Ser His Leu Ser 
130 135 140 

Ser Asn Thr Met Leu Leu Ser Asp Trp Leu Asp Lys Arg Ala Pro Asn 
145 150 155 160 

Ala Tyr Lys Ser Leu Lys Arg Ala Leu Gly Ser Val Val Phe His Pro 
165 170 175 

Ser Met Leu Thr Ser Tyr Thr Leu Met Val Lys Ala Asp Val Lys Pro 
180 185 190 

Lys Leu Asp Asn Thr Pro Leu Ser Lys Tyr Val Thr Gly Gin Asn He 
195 200 205 

Val Tyr His Asp Arg Cys Val Thr Ala Leu Phe Ser Cys He Phe Thr 
210 215 220 

Ala Cys Val Glu Arg Leu Lys Tyr Val Val Asp Glu Arg Trp Leu Phe 
225 230 235 240 

Tyr His Gly Met Asp Thr Ala Glu Leu Ala Ala Ala Leu Arg Asn Asn 
245 250 255 

Leu Gly Asp He Arg Gin Tyr Tyr Thr Tyr Glu Leu Asp He Ser Lys 
260 265 270 

Tyr Asp Lys Ser Gin Ser Ala Leu Met Lys Gin Val Glu Glu Leu He 
275 280 285 

Leu Leu Thr Leu Gly Val Asp Arg Glu Val Leu Ser Thr Phe Phe Cys 
290 295 300 

Gly Glu Tyr Asp Ser Val Val Arg Thr Met Thr Lys Glu Leu Val Leu 
305 310 315 320 

Ser Val Gly Ser Gin Arg Arg Ser Gly Gly Ala Asn Thr Trp Leu Gly 
325 330 335 
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Asn Ser Leu Val Leu Cys Thr Leu Leu Ser Val Val Leu Arg Oly Leu 
340 345 

Asp Tyr Ser Tyr He Val Val Ser Oly Asp Asp Ser Leu lie Phe Ser 
355 360 3 " 

Gin Pro Leu Asp He Asp Thr Ser Val Leu Ser Asp Asn Phe Gly 
370 

Phe Asp Val Lys He Phe Asn Gin Ala Ala Pro Tyr Phe Cys Ser Lys 

385 jyU 

Phe Leu val Gin Val Glu Asp Ser Leu Phe Phe Val Pro Asp Pro Leu 
405 410 415 

Lys Leu Phe Val Lys Phe Gly Ala Ser Lys Thr Ser Asp lie Asp Leu 
420 425 430 

Leu His Glu lie Phe Gin Ser Phe Val Asp Leu Ser Lys Gly Phe Asn 
435 

^g Glu Asp Val He Gin Glu Leu Ala Lys Leu Val Thr Arg Lys Tyr 

450 455 
Lys His Ser Gly Trp Thr Tyr Ser Ala Leu Cys Val Leu His Val Leu 
465 470 

ser Ala Asn Phe Ser Gin Phe Cys Arg Leu Tyr Tyr His Asn Ser Val 
485 490 495 

A sn Leu Asp Val Arg Pro He Gin Arg Thr Glu Ser Leu Ser Leu Leu 

500 505 
A la Leu Lys Ala Arg He Leu Arg Trp Lys Ala Ser Arg Phe Ala Phe 

S20 525 
515 biI0 

Ser He Lys Arg Gly 
530 

2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 

(B) TYPE: nucleic acid 
45 ( C ) STRANDEDNESS : double 
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(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATGGAAGTAG GTATAGATTT TGGAACCACT TTCAGCACAA TCTGCTTTTC CCCATCTGGG 
GTCAGCGGTT GTACTCCTGT GGCCGGTAGT GTTTACGTTG AAACCCAAAT TTTTATACCT 
GAAGGTAGCA GTACTTACTT AATTGGTAAA GCTGCGGGGA AAGCTTATCG TGACGGTGTA 
GAGGGAAGGT TGTATGTTAA CCCGAAAAGG TGGGCAGGTG TGACGAGGGA TAACGTCGAA 
CGCTACGTCG AGAAATTAAA AC CTAC ATAC ACCGTGAAGA TAGACAGCGG AGGCGCCTTA 
TTAATTGGAG GTTTAGGTTC CGGACCAGAC ACCTTATTGA GGGTCGTTGA CGTAATATGT 
TTATTCTTGA GAGCCTTGAT ACTGGAGTGC GAAAGGTATA CGTCTACGAC GGTTACAGCA 
GCTGTTGTAA CGGTACCGGC TGACTATAAC TCCTTTAAAC GAAGCTTCGT TGTTGAGGCG 
CTAAAAGGTC TTGGTATACC GGTTAGAGGT GTTGTTAACG AACCGACGGC CGCAGCCCTC 
TATTCCTTAG CTAAGTCGCG AGTAGAAGAC CTATTATTAG CGGTTTTTGA TTTTGGGGGA 
GGGACTTTCG ACGTCTCATT CGTTAAGAAG AAGGGAAATA TACTATGCGT CATCTTTTCA 
GTGGGTGATA ATTTCTTGGG TGGTAGAGAT ATTGATAGAG CTATCGTGGA AGTTATCAAA 
CAAAAGATCA AAGGAAAGGC GTCTGATGCC AAGTTAGGGA TATTCGTATC CTCGATGAAG 
GAAGACTTGT CTAACAATAA CGCTATAACG CAACACCTTA TCCCCGTAGA AGGGGGTGTG 
GAGGTTGTGG ATTTGACTAG CGACGAACTG GACGCAATCG TTGCACCATT CAGCGCTAGG 
GCTGTGGAAG TATTCAAAAC TGGTCTTGAC AACTTTTACC CAGACCCGGT TATTGCCGTT 



60 
120 
180 
240 
300 
360 
420 
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ATGACTGGGG GGTCAAGTGC TCTAGTTAAG GTCAGGAGTG ATGTGGCTAA TTTGCCGCAG ,0,0 

ATATCTAAAG TCGTGTTCGA CAGTACCGAT TTTAGATGTT CGGTGGCTTG TGGGGCTAAG IOBO 

5 GTTTACTGCG ATACTTTGGC AGGTAATAGC GGACTGAGAC TGGTGGACAC TTTAACGAAT 1». 

ACGCTAACGG ACGAGGTAGT GGGTCTTCAG CCGGTGGTAA TTTTCCCGAA AGGTAGTCCA 1200 

ATACCCTGTT CATATACTCA TAGATACACA GTGGGTGGTG GAGATGTGGT ATACGGTATA I2S0 

10 XTTGAAGGGG AGAATAACAG AGCTTTTCTA AATGAGCCGA CGTTCCGGGG CGTATCGAAA X320 

CGTAGGGGAG ACCCAGTAGA GACCGACGTG GCGCAGTTTA ATCTCTCCAC GGACGGAACG X380 

15 GTGTCTGTTA TCGTTAATGG TGAGGAAGTA AAGAATGAAT ATCTGGTACC CGGGACAACA X«0 

AACGTACTGG ATTCATTGGT CTATAAATCT GGGAGAGAAG ATTTAGAGGC TAAGGCAATA XS00 

CCAGAGTACT TGACCACACT GAATATTTTG CACGATAAGG CTTTCACGAG GAGAAACCTG 15*0 

20 GGTAACAAAG ATAAGGGGTT CTCGGATTTA AGGATAGAAG AAAATTTTTT AAAAXCCGCC 1*20 



GTAGATACAG ACACGATTTT GAATGGATAA 
25 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 



30 



(ii> MOLECULE TYPE : protein 
35 (iii) HYPOTHETICAL: YES 



X650 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Met Clu val Oly He Asp Phe Gly Thr Thr Phe Ser Tnr lie Cys Phe 

in * 3 



X 



45 



WO 97/22700 



82 



PCT/US96/20747 



Ser Pro Ser Gly Val Ser Gly Cys Thr Pro Val Ala Gly Ser Val Tyr 
20 25 30 

Val Glu Thr Gin He Phe He Pro Glu Gly Ser Ser Thr Tyr Leu He 
35 40 45 

Gly Lys Ala Ala Gly Lys Ala Tyr Arg Asp Gly Val Glu Gly Arg Leu 
50 55 €0 

Tyr Val Asn Pro Lys Arg Trp Ala Gly Val Thr Arg Asp Asn Val Glu 
65 70 75 80 

Arg Tyr Val Glu Lys Leu Lys Pro Thr Tyr Thr Val Lys He Asp Ser 



85 



90 



95 



Gly Gly Ala Leu Leu He Gly Gly Leu Gly Ser Gly Pro Asp Thr Leu 
100 105 HO 

Leu Arg Val Val Asp Val He Cys Leu Phe Leu Arg Ala Leu He Leu 



115 



120 



125 



Glu Cys Glu Arg Tyr Thr Ser Thr Thr Val Thr Ala Ala Val Val Thr 
130 135 140 

Val Pro Ala Asp Tyr Asn Ser Phe Lys Arg Ser Phe Val Val Glu Ala 
145 150 155 160 

Leu Lys Gly Leu Gly He Pro Val Arg Gly Val Val Asn Glu Pro Thr 
165 170 175 

Ala Ala Ala Leu Tyr Ser Leu Ala Lys Ser Arg Val Glu Asp Leu Leu 
180 185 190 



Leu 



Ala Val Phe Asp Phe Gly Gly Gly Thr Phe Asp Val Ser Phe Val 



195 



200 



205 



Lys Lys Lys Gly Asn He Leu Cys Val He Phe Ser Val Gly Asp Asn 



210 



215 



220 



Phe Leu Gly Gly Arg Asp He Asp Arg Ala He Val Glu Val He Lys 
225 230 235 240 

Gin Lys He Lys Gly Lys Ala Ser Asp Ala Lys Leu Gly lie Phe Val 



245 



250 



255 
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15 



20 



25 



30 



35 



40 



Ser Ser Met Lys 
260 



Glu Asp Leu Ser Asn Asn Asa Ala He Thr Gin His 



265 



270 



L eu lie Pro Val Glu Gly Gly Val Glu Val Val Asp Leu Thr Ser Asp 
275 280 

? oc 300 
290 29b 

Phe Lys Thr Gly Leu Asp Asn Phe Tyr Pro Asp Pro Val lie Ala Val 
305 

„et Thr Gly Gly Ser Ser Ala Leu Val Lys Val Arg Ser Asp Val Ala 
325 330 " 5 

A sn Leu Pro Gin lie Ser Lys Val Val Phe Asp Ser Thr Asp Phe Arg 
340 345 350 

C ys ser Val Ala Cys Gly Ala Lys Val Tyr Cys Asp Thr Leu Ala Gly 

355 360 

Asn ser Gly Leu Arg Leu Val Asp Thr Leu Thr Asn Thr Leu Thr Asp 

-> 7 c 380 
370 J/:> 

Gl » Val Val Gly Leu Gin Pro Val Val He Phe Pro Lys Gly Ser Pro 

385 390 

B nw Thr His Arg Tyr Thr Val Gly Gly Gly Asp Val 
He Pro Cys Ser Tyr Thr his Arg »y 

405 410 415 

val Tyr Gly He Phe Glu Gly Glu Asn Asn Arg Ala Phe Leu Asn Glu 
420 "5 43 ° 

435 

Asp val Ala Gin Phe Asn Leu Ser Thr Asp Gly Thr Val Ser Val lie 
450 455 

val Asn Gly Glu Glu Val Lys Asn Glu Tyr Leu Val Pro Gly Thr Thr 
465 470 4 " 

Asn val Leu Asp Ser Leu Val Tyr Lys Ser Gly Arg Glu Asp Leu Glu 

485 4yU 



45 
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Ala Lys Ala He Pro Glu Tyr Leu Thr Thr Leu Asn He Leu His Asp 
500 505 510 

Lys Ala Phe Thr Arg Arg Asn Leu Gly Asn Lys Asp Lys Gly Phe Ser 
5 515 520 525 

Asp Leu Arg He Glu Glu Asn Phe Leu Lys Ser Ala Val Asp Thr Asp 
530 535 540 

10 Thr He Leu Asn Gly 

545 

(2) INFORMATION FOR SEQ ID NO: 7: 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

20 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGATAAAT ATATTTATGT AACGGGGATA TTAAACCCTA ACGAGGCTAG AGACGAGGTA 
TTCTCGGTAG TGAATAAGGG ATATATTGGA CCGGGAGGGC GCTCCTTTTC GAATCGTGGT 

35 

AGTAAGTACA CCGTCGTCTG GGAAAACTCT GCTGCGAGGA TTAGTGGATT TACGTCGACT 
TCGCAATCTA CGATAGATGC TTTCGCGTAT TTCTTGTTGA AAGGCGGATT GACTACCACG 
4 0 CTCTCTAACC CAATAAACTG TGAGAATTGG GTCAGGTCAT CTAAGGATTT AAGCGCGTTT 

TTCAGGACCC TAATTAAAGG TAAGATTTAT GCATCGCGTT CTGTGGACAG CAATCTTCCA 
AAGAAAGACA GGGATGACAT CATGGAAGCG AGTCGACGAC TATCGCCATC GGACGCCGCC 



45 
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10 



15 



25 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



45 (ii) MOLECULE TYPE: protein 



480 



540 

600 

660 

720 

780 

840 

900 

960 



TTTTG CAG AG CAGTGTCGGT TCAGGTAGGG AAGTATGTGG ACGTAACGCA GAATTTAGAA 
AGTACGATCG TGCCGTTAAG AGTTATGGAA ATAAAGAAAA GACGAGGATC AGCACATGTT 
AGTTTACCGA AGGTGGTATC CGCTTACGTA GATTTTTATA CGAACTTGCA GGAATTGCTG 
TCGGATGAAG TAACTAGGGC CAGAACCGAT ACAGTTTCGG CATACGCTAC CGACTCTATG 
GCTTTCTTAG TTAAGATGTT ACCCCTGACT GCTCGTGAGC AGTGGTTAAA AGACGTGCTA 
GGATATCTGC TGGTACGGAG ACGACCAGCA AATTTTTCCT ACGACGTAAG AGTAGCTTGG 
GTATATGACG TGATCGCTAC GCTCAAGCTG GTCATAAGAT TGTTTTTCAA CAAGGACACA 
CCCGGGGGTA TTAAAGACTT AAAACCGTGT GTGCCTATAG AGTCATTCGA CCCCTTTCAC 
GAGCTTTCGT CCTATTTCTC TAGGTTAAGT TACGAGATGA CGACAGGTAA AGGGGGAAAG 
ATATGCCCGG AGATCGCCGA GAAGTTGGTG CGCCGTCTAA TGGAGGAAAA CTATAAGTTA 1020 
20 AGATTGACCC CAGTGATGGC CTTAATAATT ATACTGGTAT ACTACTCCAT TTACGGCACA 1080 

AACGCTACCA GGATTAAAAG ACGCCCGGAT TTCCTCAATG TGAGGATAAA GGGAAGAGTC 1140 
GAGAAGGTTT CGTTACGGGG GGTAGAAGAT CGTGCCTTTA GAATATCAGA AAAGCGCGGG 1200 
ATAAACGCTC AACGTGTATT ATGTAGGTAC TATAGCGATC TCACATGTCT GGCTAGGCGA 1260 
CATTACGGCA TTCGCAGGAA CAATTGGAAG ACGCTGAGTT ATGTAGACGG GACGTTAGCG 1320 
30 TATGACACGG CTGATTGTAT AACTTCTAAG GTGAGAAATA CGATCAACAC CGCAGATCAC 1380 

GCTAGCATTA TACACTATAT CAAGACGAAC GAAAACCAGG TTACCGGAAC TACTCTACCA 1440 

35 CACCAGCTTT AA 

(2) INFORMATION FOR SEQ ID NO: 8: 



1452 



WO 97/22700 

(iii) HYPOTHETICAL: YES 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asp Lys Tyr lie Tyr Val Thr Gly He Leu Asn Pro Asn Glu Ala 
! 5 10 15 

Arg Asp Glu Val Phe Ser Val Val Asn Lys Gly Tyr lie Gly Pro Gly 
20 25 30 

Gly Arg Ser Phe Ser Asn Arg Gly Ser Lys Tyr Thr Val Val Trp Glu 
35 40 45 



Asn 



Ser Ala Ala Arg He Ser Gly Phe Thr Ser Thr Ser Gin Ser Thr 



50 



55 60 



lie Asp Ala Phe Ala Tyr Phe Leu Leu Lys Gly Gly Leu Thr Thr Thr 
65 70 « 80 

Leu Ser Asn Pro He Asn Cys Glu Asn Trp Val Arg Ser Ser Lys Asp 
85 90 95 

Leu Ser Ala Phe Phe Arg Thr Leu lie Lys Gly Lys lie Tyr Ala Ser 
100 105 HO 

Arg Ser Val Asp Ser Asn Leu Pro Lys Lys Asp Arg Asp Asp He Met 
115 120 125 



Glu Ala Ser Arg Arg Leu Ser Pro Ser Asp Ala Ala Phe Cys Arg Ala 
130 



135 "0 



Val Ser Val Gin Val Gly Lys Tyr Val Asp Val Thr Gin Asn Leu Glu 
145 

Ser Thr lie Val Pro Leu Arg Val Met Glu lie Lys Lys Arg Arg Gly 
165 1™ 175 

Ser Ala His Val Ser Leu Pro Lys Val Val Ser Ala Tyr Val Asp Phe 
1B0 185 190 

Tyr Thr Asn Leu Gin Glu Leu Leu Ser Asp Glu Val Thr Arg Ala Arg 



195 



200 



205 
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Thr Asp Thr Val Ser Ala Tyr Ala Thr Asp Ser Met Ala Phe Leu Val 



210 



215 



220 



Lys Met Leu Pro Leu Thr Ala Arg Qlu Gin Trp Leu Lys Asp Val Leu 
,, Q 235 240 

225 230 



Gly Tyr Leu Leu Val Arg Arg Arg Pro Ala Asn Phe Ser Tyr Asp Val 
245 250 255 

Arg Val Ala Trp Val Tyr Asp Val lie Ala Thr Leu Lys Leu Val lie 



260 



265 



270 



Arg Leu 



Phe Phe Asn Lys Asp Thr Pro Gly Gly lie Lys Asp Leu Lys 



280 



285 



275 

Pro cys Val Pro lie Glu Ser Phe Asp Pro Phe His Glu Leu Ser Ser 
290 295 300 



Tyr Phe Ser Arg Leu Ser Tyr Glu Met Thr Thr Gly Lys Gly Gly Lys 

310 315 320 

305 310 

He Cys Pro Glu lie Ala Glu Lys Leu Val Arg Arg Leu Met Glu Glu 
325 330 335 

Asn Tyr Lys Leu Arg Leu Thr Pro Val Met Ala Leu lie lie He Leu 
340 345 350 

val Tyr Tyr Ser He Tyr Gly Thr Asn Ala Thr Arg lie Lys Arg Arg 
355 

Pro Asp Phe Leu Asn Val Arg lie Lys Gly Arg Val Glu Lys Val Ser 
370 

Leu Arg Gly Val Glu Asp Arg Ala Phe Arg He Ser Glu Lys Arg Gly 
, qo 395 400 

385 390 

lie Asn Ala Gin Arg Val Leu Cys Arg Tyr Tyr Ser Asp Leu Thr Cys 
405 410 415 

Leu Ala Arg Arg His Tyr Gly He Arg Arg Asn Asn Trp Lys Thr Leu 
420 425 430 

Ser Tyr Val Asp Gly Thr Leu Ala Tyr Asp Thr Ala Asp Cys lie Thr 



435 



440 



445 
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Ser Lys Val Arg Asn Thr lie Asn Thr Ala Asp His Ala Ser lie lie 
450 455 460 

His Tyr He Lys Thr Asn Glu Asn Gin Val Thr Gly Thr Thr Leu Pro 

465 470 475 480 



His Gin Leu 



10 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

1 5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

20 (iv) ANTI-SENSE: NO 



30 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

ATGGCATTTG AACTGAAATT AGGGCAGATA TATGAAGTCG TCCCCGAAAA TAATTTGAGA 
GTTAGAGTGG GGGATGCGGC ACAAGGAAAA TTTAGTAAGG CGAGTTTCTT AAAGTACGTT 
AAGGACGGGA CACAGGCGGA ATTAACGGGA ATCGCCGTAG TCCCCGAAAA ATACGTATTC 
GCCACAGCAG CTTTGGCTAC AGCGGCGCAG GAGCCACCTA GGCAGCCACC AGCGCAAGTG 
3 5 GCGGAACCAC AGGAAACCGA TATAGGGGTA GTGCCGGAAT CTGAGACTCT CACACCAAAT 

AAGTTGGTTT TCGAGAAAGA TCCAGACAAG TTCTTGAAGA CTATGGGCAA GGGAATAGCT 
TTGGACTTGG CGGGAGTTAC CCACAAACCG AAAGTTATTA ACGAGCCAGG GAAAGTATCA 
40 GTAGAGGTGG CAATGAAGAT TAATGCCGCA TTGATGGAGC TGTGTAAGAA GGTTATGGGC 

GCCGATGACG CAGCAACTAA GACAGAATTC TTCTTGTACG TGATGCAGAT TGCTTGCACG 
4 5 TTCTTTACAT CGTCTTCGAC GGAGTTCAAA GAGTTTGACT ACATAGAAAC CGATGATGGA 
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AAGAAGATAT ATGCGGTGTG GGTATATGAT TGCATTAAAC AAGCTGCTGC TTCGACGGGT 660 
TATGAAAACC CGGTAAGGCA GTATCTAGCG TACTTCACAC CAACCTTCAT CACGGCGACC 720 
CTGAATGGTA AACTAGTGAT GAACGAGAAG GTTATGGCAC AGCATGGAGT ACCACCGAAA 780 
TTCTTTCCGT ACACGATAGA CTGCGTTCGT CCGACGTACG ATCTGTTCAA CAACGACGCA 
ATATTAGCAT GGAATTTAGC TAGACAGCAG GCGTTTAGAA ACAAGACGGT AACGGCCGAT 
AACACCTTAC ACAACGTCTT CCAACTATTG CAAAAGAAGT AG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



840 



900 



942 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Phe Glu Leu Lys Leu Gly Gin lie Tyr Glu Val Val Pro Glu 
3. 5 10 15 



Asn Asn Leu Arg Val Arg Val Gly Asp Ala Ala Gin Gly Lys Phe Ser 
20 25 30 

Lys Ala Ser Phe Leu Lys Tyr Val Lys Asp Gly Thr Gin Ala Glu Leu 



35 



40 45 



Thr Gly lie Ala Val Val Pro Glu Lys Tyr Val Phe Ala Thr Ala Ala 
50 

Leu Ala Thr Ala Ala Gin Glu Pro Pro Arg Gin Pro Pro Ala Gin Val 

65 
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Ala Glu Pro Gin Glu Thr Asp lie Gly Val Val Pro Glu Ser Glu Thr 
85 9° 95 

Leu Thr Pro Asn Lys Leu Val Phe Glu Lys Asp Pro Asp Lys Phe Leu 
100 105 HO 

Lys Thr Met Gly Lys Gly He Ala Leu Asp Leu Ala Gly Val Thr His 
115 120 125 

Lys Pro Lys Val lie Asn Glu Pro Gly Lys Val Ser Val Glu Val Ala 
130 135 "° 

Met Lys lie Asn Ala Ala Leu Met Glu Leu Cys Lys Lys Val Met Gly 
145 150 155 160 

Ala Asp Asp Ala Ala Thr Lys Thr Glu Phe Phe Leu Tyr Val Met Gin 
165 I 70 175 



lie Ala Cys Thr Phe Phe Thr Ser Ser Ser Thr Glu Phe Lys Glu Phe 
180 I" 19° 

Asp Tyr He Glu Thr Asp Asp Gly Lys Lys lie Tyr Ala Val Trp Val 
195 200 205 

Tyr Asp Cys lie Lys Gin Ala Ala Ala Ser Thr Gly Tyr Glu Asn Pro 
210 215 220 

Val Arg Gin Tyr Leu Ala Tyr Phe Thr Pro Thr Phe lie Thr Ala Thr 



225 



230 235 240 



Leu Asn Gly Lys Leu Val Met Asn Glu Lys Val Met Ala Gin His Gly 
245 250 255 



Val Pro Pro Lys Phe Phe Pro Tyr Thr lie Asp Cys Val Arg Pro Thr 
260 265 270 

Tyr Asp Leu Phe Asn Asn Asp Ala He Leu Ala Trp Asn Leu Ala Arg 



275 



280 285 



Gin Gin Ala Phe Arg Asn Lys Thr Val Thr Ala Asp Asn Thr Leu His 
290 295 300 

Asn Val Phe Gin Leu Leu Gin Lys Lys 
305 310 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGTACAGTA GAGGGTCTTT CTTTAAGTCT CGGGTTACCC TTCCTACTCT TGTCGGAGCA 
TACATGTGGG AGTTTGAACT CCCGTATCTT ACGGACAAGA GACACATCAG CTATAGCGCG 
CCAAGTGTCG CGACTTTTAG CCTTGTGTCG AGGTAG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY : not relevant 

(ii) MOLECULE TYPE : protein 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 

Met Tyr Ser Arg Gly Ser Phe Phe Lys Ser Arg Val Thr Leu Pro Thr 
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Leu Val Gly Ala Tyr Met Trp GIu Phe Glu Leu Pro Tyr Leu Thr Asp 
20 25 30 

Lys Arg His lie Ser Tyr Ser Ala Pro Ser Val Ala Thr Phe Ser Leu 
35 40 45 

Val Ser Arg 
50 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGGATGATT TTAAACAGGC AATACTGTTG CTAGTAGTCG ATTTTGTCTT CGTGATAATT 60 
CTGCTGCTGG TTCTTACGTT CGTCGTCCCG AGGTTACAGC AAAGCTCCAC CATTAATACA 120 
GGTCTTAGGA CAGTGTGA 138 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Asp Phe Lys Gin Ala He Leu Leu Leu Val Val Asp Phe Val 
15 10 15 

Phe Val He He Leu Leu Leu Val Leu Thr Phe Val Val Pro Arg Leu 
20 25 30 

Gin Gin Ser Ser Thr He Asn Thr Gly Leu Arg Thr Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 34 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGGGAGCTT ATACACATGT AGACTTTCAT GAGTCGCGGT TGCTGAAAGA CAAACAAGAC 60 

TATCTTTCTT TCAAGTCAGC GGATGAAGCT CCTCCTGATC CTCCCGGATA CGTTCGCCCA 120 

GATAGTTATG TGAGGGCTTA TTTGATACAA AGAGCAGACT TTCCCAATAC TCAAAGCTTA 180 

TCAGTTACGT TATCGATAGC CAGTAATAAG TTAGCTTCAG GTCTTATGGG AAGCGACGCA 240 

GTATCATCGT CGTTTATGCT GATGAACGAC GTGGGAGATT ACTTCGAGTG CGGCGTGTGT 3 00 

CACAACAAAC CCTACTTAGG ACGGGAAGTT ATCTTCTGTA GGAAATACAT AGGTGGGAGA 360 

GGAGTGGAGA TCACCACTGG TAAGAACTAC ACGTCGAACA ATTGGAACGA GGCGTCGTAC 42 0 



WO 97/22700 PCT/US96/20747 

94 

GTAATACAAG TGAACGTAGT CGATGGGTTA GCACAGACCA CTGTTAATTC TACTTATACG 4 80 

CAAACGGACG TTAGTGGTCT ACCCAAAAAT TGGACGCGTA TCTACAAAAT AACAAAGATA 540 

GTGTCCGTAG ATCAGAACCT CTACCCTGGT TGTTTCTCAG ACTCGAAACT GGGTGTAATG 600 

CGTATAAGGT CACTGTTAGT TTCCCCAGTG CGCATCTTCT TTAGGGATAT CTTATTGAAA 660 

CCTTTGAAGA AATCGTTCAA CGCAAGAATC GAGGATGTGC TGAATATTGA CGACACGTCG 720 

TTGTTAGTAC CGAGTCCTGT CGTACCAGAG TCTACGGGAG GTGTAGGTCC ATCAGAGCAG 780 

CTGGATGTAG TGGCTTTAAC GTCCGACGTA ACGGAATTGA TCAACACTAG GGGGCAAGGT 840 

AAGATATGTT TTCCAGACTC AGTGTTATCG ATCAATGAAG CGGATATCTA CGATGAGCGG 900 

TATTTGCCGA TAACGGAAGC TCTACAGATA AACGCAAGAC TACGCAGACT CGTTCTTTCG 960 

AAAGGCGGGA GTCAAACACC ACGAGATATG GGGAATATGA TAGTGGCCAT GATACAACTT 1020 

TTCGTACTCT ACTCTACTGT AAAGAATATA AGCGTCAAAG ACGGGTATAG GGTGGAGACC 1080 

GAATTAGGTC AAAAGAGAGT CTACTTAAGT TATTCGGAAG TAAGGGAAGC TATATTAGGA 114 0 

GGGAAATACG GTGCGTCTCC AACCAACACT GTGCGATCCT TCATGAGGTA TTTTGCTCAC 1200 

ACCACTATTA CTCTACTTAT AGAGAAGAAA ATTCAGCCAG CGTGTACTGC CCTAGCTAAG 1260 

CACGGCGTCC CGAAGAGGTT CACTCCGTAC TGCTTCGACT TCGCACTACT GGATAACAGA 1320 

TATTACCCGG CGGACGTGTT GAAGGCTAAC GCAATGGCTT GCGCTATAGC GATTAAATCA 1380 

GCTAATTTAA GGCGTAAAGG TTCGGAGACG TATAACATCT TAGAAAGCAT TTGA 1434 
(2) INFORMATION FOR SEQ ID NO: 16: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Met Gly Ala Tyr Thr His Val Asp Phe His Glu Ser Arg Leu Leu Lys 
x 5 10 v. 15 

Asp Lys Gin Asp Tyr Leu Ser Phe Lys Ser Ala Asp Glu Ala Pro Pro 
20 25 30 

Asp Pro Pro Gly Tyr Val Arg Pro Asp Ser Tyr Val Arg Ala Tyr Leu 
35 40 45 

lie Gin Arg Ala Asp Phe Pro Asn Thr Gin Ser Leu Ser Val Thr Leu 
50 55 €0 

Ser He Ala Ser Asn Lys Leu Ala Ser Gly Leu Met Gly Ser Asp Ala 
65 70 75 80 

Val Ser Ser Ser Phe Met Leu Met Asn Asp Val Gly Asp Tyr Phe Glu 
85 90 95 

Cys Gly Val Cys His Asn Lys Pro Tyr Leu Gly Arg Glu Val He Phe 
100 105 HO 

Cys Arg Lys Tyr He Gly Gly Arg Gly Val Glu He Thr Thr Gly Lys 
115 120 125 

Asn Tyr Thr Ser Asn Asn Trp Asn Glu Ala Ser Tyr Val He Gin Val 
130 135 140 

Asn Val Val Asp Gly Leu Ala Gin Thr Thr Val Asn Ser Thr Tyr Thr 
145 150 155 160 

Gin Thr Asp Val Ser Gly Leu Pro Lys Asn Trp Thr Arg He Tyr Lys 
165 170 175 

He Thr Lys He Val Ser Val Asp Gin Asn Leu Tyr Pro Gly Cys Phe 
180 185 190 

Ser Asp Ser Lys Leu Gly Val Met Arg He Arg Ser Leu Leu Val Ser 
195 200 205 

Pro Val Arg He Phe Phe Arg Asp He Leu Leu Lys Pro Leu Lys Lys 
210 215 220 
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Ser Phe Asn Ala Arg lie Glu Asp Val Leu Asn lie Asp Asp Thr Ser 
225 230 235 240 

Leu Leu Val Pro Ser Pro Val Val Pro Glu Ser Thr Gly Gly Val Gly 
245 250 255 

Pro Ser Glu Gin Leu Asp Val Val Ala Leu Thr Ser Asp Val Thr Glu 
260 265 270 

Leu lie Asn Thr Arg Gly Gin Gly Lys lie Cys Phe Pro Asp Ser Val 
275 280 285 

Leu Ser He Asn Glu Ala Asp He Tyr Asp Glu Arg Tyr Leu Pro lie 
290 295 300 

Thr Glu Ala Leu Gin He Asn Ala Arg Leu Arg Arg Leu Val Leu Ser 
305 310 315 320 

Lys Gly Gly Ser Gin Thr Pro Arg Asp Met Gly Asn Met He Val Ala 
325 330 335 

Met He Gin Leu Phe Val Leu Tyr Ser Thr Val Lys Asn He Ser Val 
340 345 350 

Lys Asp Gly Tyr Arg Val Glu Thr Glu Leu Gly Gin Lys Arg Val Tyr 
355 360 365 



Leu Ser Tyr Ser Glu Val Arg Glu Ala He Leu Gly Gly Lys Tyr Gly 

370 375 380 

Ala Ser Pro Thr Asn Thr Val Arg Ser Phe Met Arg Tyr Phe Ala His 

385 390 395 400 

Thr Thr He Thr Leu Leu He Glu Lys Lys He Gin Pro Ala Cys Thr 

405 410 415 

Ala Leu Ala Lys His Gly Val Pro Lys Arg Phe Thr Pro Tyr Cys Phe 



420 



425 



430 



Asp Phe Ala Leu Leu Asp Asn Arg Tyr Tyr Pro Ala Asp Val Leu Lys 
435 440 445 



Ala Asn Ala Met Ala Cys Ala He Ala He Lys Ser Ala Asn Leu Arg 
450 455 460 
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Arg Lys Gly Ser Glu Thr Tyr Asn He Leu Glu Ser He 

465 470 475 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

ATGGAATTCA GACCAGTTTT AATTACAGTT CGCCGTGATC CCGGCGTAAA CACTGGTAGT 60 

TTGAAAGTGA TAGCTTATGA CTTACACTAC GACAATATAT TCGATAACTG CGCGGTAAAG 120 

TCGTTTCGAG ACACCGACAC TGGATTCACT GTTATGAAAG AATACTCGAC GAATTCAGCG 180 

TTCATACTAA GTCCTTATAA ACTGTTTTCC GCGGTCTTTA ATAAGGAAGG TGAGATGATA 24 0 

AGTAACGATG TAGGATCGAG TTTCAGGGTT TACAATATCT TTTCGCAAAT GTG TAAAG AT 300 

ATCAACGAGA TCAGCGAGAT ACAACGCGCC GGTTACCTAG AAACATATTT AGGAGACGGG 360 

CAGGCTGACA CTGATATATT TTTTGATGTC TTAACCAACA ACAAAGCAAA GGTAAGGTGG 4 20 

TTAGTTAATA AAGACCATAG CGCGTGGTGT GGGATATTGA ATGATTTGAA GTGGGAAGAG 4 80 

AGCAACAAGG AGAAATTTAA GGGGAGAGAC ATACTAGATA CTTACGTTTT ATCGTCTGAT 54 0 
TATCCAGGGT TTAAATGA 
(2) INFORMATION FOR SEQ ID NO:18: 
<i) SEQUENCE CHARACTERISTICS: 



558 
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(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Met Glu Phe Arg Pro Val Leu lie Thr Val Arg Arg Asp Pro Gly Val 
15 10 15 

Asn Thr Gly Ser Leu Lys Val lie Ala Tyr Asp Leu His Tyr Asp Asn 
20 25 30 

lie Phe Asp Asn Cys Ala Val Lys Ser Phe Arg Asp Thr Asp Thr Gly 
35 40 45 

Phe Thr Val Met Lys Glu Tyr Ser Thr Asn Ser Ala Phe lie Leu Ser 
50 55 60 

Pro Tyr Lys Leu Phe Ser Ala Val Phe Asn Lys Glu Gly Glu Met He 
65 70 75 80 

Ser Asn Asp Val Gly Ser Ser Phe Arg Val Tyr Asn He Phe Ser Gin 
85 90 95 

Met Cys Lys Asp He Asn Glu He Ser Glu He Gin Arg Ala Gly Tyr 
100 105 HO 

Leu Glu Thr Tyr Leu Gly Asp Gly Gin Ala Asp Thr Asp He Phe Phe 
115 120 125 

Asp Val Leu Thr Asn Asn Lys Ala Lys Val Arg Trp Leu Val Asn Lys 
130 135 140 

Asp His Ser Ala Trp Cys Gly He Leu Asn Asp Leu Lys Trp Glu Glu 
145 150 155 160 
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Ser Asn Lys Glu Lys Phe Lya Gly Arg Asp He Leu Asp Thr Tyr Val 



165 



170 1" 



Leu Ser Ser Asp Tyr Pro Gly Phe Lys 
5 180 185 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
10 (a) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

15 (ii ) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
" ATGAAGTTGC TTTCGCTCCG CTATCTTATC TTAAGGTTGT CAAAGTCGCT TAGAACGAAC 

GATCACTTGG TTTTAATACT TATAAAGGAG GCGCTTATAA ACTATTACAA CGCCTCTTTC 
30 ACCGATGAGG GTGCCGTATT AAGAGACTCT CGCGAAAGTA TAGAGAATTT TCTCGTAGCC 

AGGTGCGGTT CGCAAAATTC CTGCCGAGTC ATGAAGGCTT TGATCACTAA CACAGTCTGT 
AAGATGTCGA TAGAAACAGC CAGAAGTTTT ATCGGAGACT TAATACTCGT CGCCGACTCC 
35 TCTGTTTCAG CGTTGGAAGA AGCGAAATCA ATTAAAGATA ATTTCCGCTT AAGAAAAAGG 

AGAGGCAAGT ATTATTATAG TGGTGATTGT GGATCCGACG TTGCGAAAGT TAAGTATATT 
4 0 TTGTCTGGGG AGAATCGAGG ATTGGGGTGC GTAGATTCCT TGAAGCTAGT TTGCGTAGGT 

AGACAAGGAG GTGGAAACGT ACTACAGCAC CTACTAATCT CATCTCTGGG TTAA 
{2} INFORMATION FOR SEQ ID NO: 20: 



120 
180 
240 
300 
360 
420 
480 
534 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Lys Leu Leu Ser Leu Arg Tyr Leu lie Leu Arg Leu Ser Lys Ser 
15 10 15 

Leu Arg Thr Asn Asp His Leu Val Leu He Leu He Lys Glu Ala Leu 
20 25 30 

He Asn Tyr Tyr Asn Ala Ser Phe Thr Asp Glu Gly Ala Val Leu Arg 
35 40 45 

Asp Ser Arg Glu Ser He Glu Asn Phe Leu Val Ala Arg Cys Gly Ser 
50 55 60 

Gin Asn Ser Cys Arg Val Met Lys Ala Leu He Thr Asn Thr Val Cys 
65 70 75 80 

Lys Met Ser He Glu Thr Ala Arg Ser Phe He Gly Asp Leu He Leu 
85 90 95 

Val Ala Asp Ser Ser Val Ser Ala Leu Glu Glu Ala Lys Ser He Lys 
100 105 HO 

Asp Asn Phe Arg Leu Arg Lys Arg Arg Gly Lys Tyr Tyr Tyr Ser Gly 
115 120 125 

Asp Cys Gly Ser Asp Val Ala Lys Val Lys Tyr lie Leu Ser Gly Glu 
130 135 140 

Asn Arg Gly Leu Gly Cys Val Asp Ser Leu Lys Leu Val Cys Val Gly 
145 150 155 160 
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Arg Gin Gly Gly Gly Asn Val Leu Gin His Leu Leu He Ser Ser Leu 
165 1™ 175 



Gly 

5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 54 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

20 



25 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:21: 

ATGGACCTAT CGTTTATTAT TGTGCAGATC CTTTCCGCCT CGTACAATAA TGACGTGACA 60 

GCACTTTACA CTTTGATTAA CGCGTATAAT AGCGTTGATG ATACGACGCG CTGGGCAGCG 120 

30 ATAAACGATC CGCAAGCTGA GGTTAACGTC GTGAAGGCTT ACGTAGCTAC TACAGCGACG 18 0 

ACTGAGCTGC ATAGAACAAT TCTCATTGAC AGTATAGACT CCGCCTTCGC TTATGACCAA 240 

GTGGGGTGTT TGGTGGGCAT AGCTAGAGGT TTGCTTAGAC ATTCGGAAGA TGTTCTGGAG 300 

35 

GTCATCAAGT CGATGGAGTT ATTCGAAGTG TGTCGTGGAA AGAGGGGAAG CAAAAGATAT 360 

CTTGGATACT TAAGTGATCA ATGCACTAAC AAATACATGA TGCTAACTCA GGCCGGACTG 42 0 

4 0 GCCGCAGTTG AAGGAGCAGA CATACTACGA ACGAATCATC TAGTCAGTGG TAATAAGTTC 4 80 

TCTCCAAATT TCGGGATCGC TAGGATGTTG CTCTTGACGC TTTGTTGCGG AGCACTATAA 54 0 

4 5 (2) INFORMATION FOR SEQ ID NO: 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Asp Leu Ser Phe He He Val Gin He Leu Ser Ala Ser Tyr Asn 
15 10 15 

Asn Asp Val Thr Ala Leu Tyr Thr Leu He Asn Ala Tyr Asn Ser Val 
20 25 30 

Asp Asp Thr Thr Arg Trp Ala Ala He Asn Asp Pro Gin Ala Glu Val 
35 40 45 

Asn Val Val Lys Ala Tyr Val Ala Thr Thr Ala Thr Thr Glu Leu His 
50 55 60 

Arg Thr He Leu He Asp Ser He Asp Ser Ala Phe Ala Tyr Asp Gin 
65 70 75 80 

Val Gly Cys Leu Val Gly He Ala Arg Gly Leu Leu Arg His Ser Glu 
85 90 95 

Asp Val Leu Glu Val He Lys Ser Met Glu Leu Phe Glu Val Cys Arg 
100 105 HO 

Gly Lys Arg Gly Ser Lys Arg Tyr Leu Gly Tyr Leu Ser Asp Gin Cys 
115 120 125 

Thr Asn Lys Tyr Met Met Leu Thr Gin Ala Gly Leu Ala Ala Val Glu 
130 135 140 

Gly Ala Asp He Leu Arg Thr Asn His Leu Val Ser Gly Asn Lys Phe 
145 150 155 160 
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Ser Pro Asn Phe Gly He Ala Arg Met Leu Leu Leu Thr Leu Cys Cys 
165 "0 175 

Gly Ala Leu 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ATGAGGCACT TAGAAAAACC CATCAGAGTA GCGGTACACT ATTGCGTCGT GCGAAGTGAC 
GTTTGTGACG GGTGGGATGT ATTTATAGGC GTAACGTTAA TCGGTATGTT TATTAGTTAC 
TATTTATATG CTCTAATTAG CATATGTAGA AAAGGAGAAG GTTTAACAAC CAGTAATGGG 
TAA 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Arg His Leu Glu Lys Pro He Arg Val Ala Val His Tyr Cys Val 
15 10 IS 

Val Arg Ser Asp Val Cys Asp Gly Trp Asp Val Phe He Gly Val Thr 
20 25 30 

Leu He Gly Met Phe He Ser Tyr Tyr Leu Tyr Ala Leu He Ser He 
35 40 45 

Cys Arg Lys Gly Glu Gly Leu Thr Thr Ser Asn Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 

(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product= "Oligonucleotide" 
/note= "N is inosine at sites in this sequence." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GGNGGNGGNA CNTTYGAYGT NTCN 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
UGAGUGAACG CGAUG 

(2) INFORMATION FOR SEQ ID NO:27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
ATAAGCATTC GGGATGGACC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
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(A) DESCRIPTION: /desc = "Oligonucleotide." 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
ATTAACTTGA CGGATGGCAC GC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide . 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
TACTTATCTA GAACCATGGA AGCGAGTCGA CGACTA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 



(iii) HYPOTHETICAL: NO 
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5 



10 



15 



20 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TCTTGAGGAT CCATGGAGAA ACATCGTCGC ATACTA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:, 
ACTATTTCTA GAACCATGGC ATTTGAACTG AAATT 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
35 ( A ) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
TTCTGAGGAT CCATGGTATA AGCTCCCATG AATTAT 



36 



35 



36 



45 
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TABLE 1 



Particle Coat protein Mr Reference 

length (X10 3 ) 

fnm) 



1,400-2,200 
l f 400-1, 800 

1,400-2,200 
1,400-2,200 
1,400-2,200 
1,400-2,200 



39 


Gugerli 




(1984) 


26 






Gugerli 




(1984) 


43 


Zinimermann 




(1990) 


36 






Zee (1987) 


36 






Hu (1990) 


36 






Zimmermann 




(1990) 




Gugerli 




(1993) 
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TABLE 2 

Nucleotide and Deduced Amino Acid Sequences for GLRaV-3 
Coat Protein. 

ATGGCAT7TGAACTGAAATTACCGCAGATATATGAA.GTCSTCCCCCAAAATAATTTCAG« 

1 - — 60 

MArSLKLCQlTfSVVJEHMLR - 

GTT AG ACTA GCCG ATGC GCCACAAGG AAAATTTAGTAAGGCGAGTTTCTTAAAGTACGTT 

€1 - 120 

VRVCD AAQGKFSKASFL KYV - 



AAGGACCGCACACAGGCGGAATTAACGGGAATCC^ 

121 « - * "0 

KDGTGAE. LTGIAVVPEKYVF - 

GCCACACCAGCTTTGGCTACAGCGGCGCAGGACCCACCTAGGCA 

181 . ► * + 240 

ATAALATAAQEPPRQPP>QV - 



GTGGAACCACAGGAAACCGATATAGCGCTAGTGCCGGAATCTGACACTCTCACACCAAAT 

_ ^ — ♦ 300 

241 v"e"p""q" etdigvvpesetltpn - 

AACTTGGTTTTCGACAJVACATCCAGACAAGTTCTTGAAGACTA 

_ 4- — — * 3 60 

301 * * 

KLVFEKDPDKFLKTMCKGIA 

TTGGACT^GACGGGAGTTACCCACAAACCGAAAGTTATTAACGAGCCAGCGAAAGTATCA 

_ ^ + * * * 420 

3 rVrrT VTHKP KVINEPCKVS - 

CTAGAGGTCCCAATG^GATTAATGCCGCAT^ 

^ _ + ♦ — - — - — -~ —™«. 4fl0 

C'TTTTTT'n AALMELCKKVHG - 

GCCG ATC ACGCACCAACTAAG ACAAAA l ' I L . 1 C : rS T A CGTCATGCAGATTGCTIgCACC 

«81 * * 5 *° 

ADOAATKTKFFLYVMQIACT - 

TTCTITACATCGTCTTCGACGGAGTrCAAACACTTrCACTACATAGAAACCCATCATGGA 

„____„-— — *- — -——»-- — ----» 600 

s " p"f"t"s"s"s t e f xefdyietdoc 

AAGAAGATATATCCGCTCTX^ATATOATTC^^ ^ 

601 — * — — *— 1 "* 

KKIYAVWVYDCIKQAAASTG - 

TATGAAAACCCGCTAAGGCAGTATCTA^CGTACTTCACACCAACCTTCAT 

, — — — ♦ 720 

661 * r 



YENPVRQY 



LAYFTP-TFITAT - 



CTGAATGCTAAAC^ACrrt^TC^^ ^ 

_ t i . . - — — — — — ~ — — 780 

TTCTTTCCGTACACGATAGA CTGCCTTCCTCCGACCTACGATCICTTCAACAACGACCCA 

______ x , ■ t ■ — — * 840 

781 f""f""p"T"t"i"'d "cvrptyolfnn da^ - 

ATATTAGC ATGG AATTTAG ^^AGACAGCAGGCG^tTTACAAACAAGACGGTAACGGCCC AT 

_ * * — - * 900 

4 :"l~T~w"n"l~". rqoafRNKtvtad - 

. AAC^CCTTACACAACCTCT^CCAACTArrCCA^ ^ 
901 
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TABLE 3 

Comparison of Coat Protein Sequences of GLRaV-3 with BTV, 
CTV" and LIYV. 



3YV_C? - 

CTV_C? - 

LIYV.C? 

GUlaV3_CP KAFELXLCQI YEWPENNLR VRVGDAAQGK FSKASFLKYV KDGTQAELTG 

CONSENSUS : — 



51 100 

BYV_CP • 

CTV_CP KDDET3GCLKN 

LXYVJCP - -MDTDCD NEVTGSGNDT RNNDDKKKEE MKQNI SDN SQ 

CLRaV3_C? IAWPEKYVF ATAALATAAQ EPPHQPPAQV VEPQETDMCV VPESETLTPN 

CONSENSUS 



101 

BYV_C? MGSAE PISAIA. -TF 

CTV__C? KNKETKEGDO WAAES - - SF 
LIYV_C? IISTRDKEAD IIGSISTCEDL 
CLRaV3_CP KLVFEXDPDK FLKTMCKGIA 

CONSENSUS <i -i f 



150 

ENVSI..AO.Q TCLKGZDCDK LRK N 

GSVNLHID.P TLITMNDVRQ LSTQQNAALN 
SKXWRVDRH DALSANDVQS FR. ..EAMIN 
LDLTCVTHKP KVI..NEPGK VSVEVAMXIN 

--v d— nd- — 1 N 



151 

BYV_C? FEZCLXLXC . 
CTvIc? RDLFLALKG X 
LIYV_CP - - . FMROKDP 
CLRaV3_CF .AALMELCKK 

CONSENSUS C — lk« 



. . .VPEDNLG IALCLCLYSC 
YPNLPDKDKD FH IAMML YRL 
NRNQPSDKLX IAKEVGVYQH 
VMGA DDAATX TKFFLYVMQI 

pd 1— y— 



200 

AT . IGTSMKV NVQPTS.TFI 
AV.KSSSLQS DDDTTGITYT 
VT2JLGTSAKL C-NANNLEFT 
ACTFFTS . .S STEFKEFDYI 

a CS 



201 



250 



BYV CP XASFCGCKEL YLTHCELNSF LCSQKLLECX PNKLRCFCRT FQKDYISLRK 

CTV^C? R ECVEV DLSDKLWTDI VYNSKGXGNR TNALRVWGRT NDALYLAFCR 

Liyv CP IAYDQETRTY KVAO..FVNY MQSR. .MRNS PNWRQYARA MEXTTNNIRS 

CLRaV^IcP ET..DDCXKI Y.-AVWVYDC IKQAAASTCY ENPVRQYLAY FTPTFITATL 



CONSENSUS 



251 3°° 
BYV_CP EYRCZXPPIA RANRHCLPAE DHYXAADF.I STSTELTDLQ QSRLLLAREN 
CTV~CP QNR.NLSYGG RPLDAGIPAG YHYLCADP.L .TCACLTDLE CAVYTQAKEQ 
LIYV_C? AGIIH.SHCV LAAKHGVLAS YRNSYSDPAV CFGNDTTDAQ LTSLMLARKQ 
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TABLE 4 

Partial GLRaV-3 Nucleotide Sequence and Encoded Proteins, 



VSTYAKSVMNDNFNI LiETIjv 
ACTTTQCCCAJ^CTrTATAGTCAAACT ^ 

TLPKSFIVKVPGSVLVSITi 
TCGGGCATTTCCGAC^CTTGAACTTCG^CG^^^ ^ 

121 agcccg^c^^^ _ 

SGISDKjE LRGAFDVSKKNf 

tccaggaggttacgttcgagtcgtttgcgcgtattctctagggct 

SRRLRSSRLRVFSRAlve.o 

atcaaggttatgaagggcatgaaatcagaggatggtaaaccactc^ 

241 ^C^AC^CC^AC^^ . 
I. KVMKGMKSEDGKPLPIAEQ 

tccgtgtacgcgttcatgacaggcaatatgtcaaacgttcattcca 
301 a^caca^gcaag^^ 

SVYAFMTGNMSNVHCTRAUi- 



TCACGCGCT 

— + 420 



ctco3gggctcaaa«*ttcx:gcck^ 

^KCCCCGAG^CCG^COCGCCG^ . 
LGGSKACAASLAVKGAASRA 

ACTGGAACAAAACTCTTTTCAGGTCTCACATCCTTTCTTTCCGCCGGTGGTCTCTTCTAC 
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AGGGTGCTTATTGGTGTTTTTACTCTTTCCATGGCTCTCACTCCGGTGGTCTGGAAGTAC 

6S1 + + * + + 720 

TCCC ACGAAT AACCACAAAAATGAGAAAGGTACCGAGAGTGAGGC CACC AG AC CTTCATG 
RVLIGVFTLSMALTPVVWXY 

AGAA GGAATATCGCGCGAACTGGCGTGGATGTTTTCCACCGTGCTCGTTCGGGTACCGCG 

721 + + + + +- + 780 

TCTTCCTTATAGCGCGCTTGACCGCACCTACAAAAGGTGGCACGAGCAAGCCCATGGCGC 
RRNIARTGVDVFHRARSGTA 

GCCATCGGTTTACAATGTCTTAGTGGAGGAAGGTCGTTAGCTGGTGACGCTGCTCGTGGC 

7gi + * + + + 840 

CGGTAGCCAAATGTTACAGAATCACCTCCTTCCAGCAATCGACCACTGCGACGAGCACCG 
A IGL QC L S G G R S LAGDAA RG 

GCGTTAACAGTGACTCGAGGAGGGCTATCTTCGGCGGTTGCGGTGACCAGAAATACAGTG 

841 + + + + + 900 

CGCAATTGTCACTGAGCTCCTCCCGATAGAAGCCGCCAACGCCACTGGTCTTTATGTCAC 
ALTVTRGGLS SAVAVTRNTV 

GCTAGGCGTCAGGTACCATTGGCGTTGCTTTCGTTTTCCACGTCTTACGCAGTCAGTGGT 

901 + + + + + + 960 

CGATC CGCAGTC CATGGTAAC CGCAACGAAAGC AAAAGGTGCAGAATGCGTCAGTC ACCA 
ARRQVPLALLSFSTSYAVSG - 

TGCACTTTGTTAGGTATTTGGGCTCATGCTCTCCCTAGGCATTTGATGTTCTTCTTTGGC 

961 + + + + + + 1020 

ACGTGAAACAATCCATAAACCCGAGTACGAGAGGGATCCGTAAACTACAAGAAGAAACCG 
CTLLGIWAKALPRHLMFFFG 

CTAGGGACGCTCTTCGGGGTGAGTGCCAGTACCAATTCTTGGTCGCTTGGGGGCTATAC6 

10 21 + + + + + + 1080 

GATCCCTGCGAGAAGCCCCACTCACGGTCATGGTTAAGAACCAGCGAACCCCCGATATGC 

LGTLFGVSASTNSWSLGGYT 

AACAGTCTGTTCACCGTACCGGAATTAACTTGGGAAGGGAGGAGTTACAGATCTTTATTG 

1081 + + * + + + 1140 

TTGTCAGACAAGTGGCATGGCCTTAATTGAACCCTTCCCTCCTCAATGTCTAGAAATAAC 
'NSLFTVPELTWEGRSY RSLL 

CCCCAAGCAGCTTTA 

114 1 + + + + ♦ + 1200 

GGGGTTCGTCGAAATCCATAAAGAGAGCAACACGCGCCCAACAATTCACTTTGACACGGT 
P QAALG I S LVVRG LL S E T V P i - 

CAACTAACGTACGTAC CGC CGATTGAAGGTCGGAATGTTTATGATCAGGCACTAAATTTT 

1201 + * + + + + 1260 

GTTGATTGCATGCATGGCGGCTAACTTCCAGCCTTACAAATACTAGTCCGTGATTTAAAA 

Q LTYVP P IEGRNVYDQALNF 
TATCGCGACTTTGACTATGACGATGGTGCAGGCCCATCCGGGACGGCTGGTCAAAGCGAT 

1261 + * * + * + 1320 

ATAGCGCTGAAACTGATACTGCTACCACGTCCGGGTAGGCCCTGCCGACCAGriTCGCTA 

Y R D FDYDDGAGPSGTAGQ S.D 
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CCTGGAACCAATACTTCGGATACTTCTTCGGTTTTCTCTGACGATGGTTTGCCCGCTAGT 

1321 * 4.-— + + + 1380 

GGACCTTGGTTATGAAGCCTATGAAGAAGCCAAAAGAGACTGCTACCAAACGGGCGATCA 
FGTNTSDT 3SVFSDDGLPAS 

GGCGGTGGCTTCGACGCGCGCGTTGAGGCAGGTCCCAGCCATGCTGTTGATGAATCACCA 
1381 -r + * + + + 1440 

CCGCCACCGAAGCTGCGCGCGCAACTCCGTCCAGGGTCGGTACGACAACTACTTAGTGGT 
GGGFDARVEAGPSHAVDESP 

AGGGGTAGTGTTGAGTTCGTC TACAGAGAACGTGTAGATGAACATC CGGCGTGTGGTGAA 
1441 + + + + + + 1S0Q 

TCCCCATCACAACTCAAGCAGATGTCTCTTGCACATCTACTTGTAGGCCGCACACCACTT 
RGSVEFVYRERVDEHPACGE 

GCTGAAGTTGAAAAGGATCTAATAACACCACTTGGTACAGCTGTCTTAGAGTCGCCCCCC 
1501 + + + + + + 1 560 

CGACTTCAACTTTTCCTAGATTATTGTGGTGAACCATGTCGACAGAATCTCAGCGGGGGG 
AEVEKDLI TPLGTAVLESPP 

GTAGGTCCTGAAGCTGGGAGCGCGCCCAACGTCGAGGACGGTTGTCCGGAGGTTGAAGCT 
1561 + + + + + + lg20 

CATCCAGGACTTCGACCCTCGCGCGGGTTGCAGCTCCTGCCAACAGGCCTCCAACTTCGA 
VGPEAGSAPNVEDGCPE VEA 

GAGAAATGTTCGGAGGTCATCGTTGACGTTCCTAGTTCAGAACCGCCGGTACAAGAAGTC 

1621 + + + + + +' 1680 

CTCTTTACAAGCCTCCAGTAGCAACTGCAAGGATCAAGTCTTGGCGGCCATGTTCTTCAG 
EKCSEVIVDVPSSEPPVQEV 

CTTGAATCAACCAATGGTGTCCAAGCTGCAAGAACTGAAGAGGTTGTGCAGGGCGACACA 

1681 + + + + + + 1740 

GAACTTAGTTGGTTACCACAGGTTCGACGTTCTTGACTTCTCCAACACGTCCCGCTGTGT 
LESTNGVQAARTEEVVQGDT 

TGTGGAGCTGGGGTAGCTAAATCAGAAGTGAGTCAACGTGTGTTTCCTGCGCAAGTACCC 
1741 + + + «. 4- + 1800 

ACACCTCGACCCCATCGATTTAGTCTTCACTCAGTTGCACACAAAGGACGCGTTCATGGG' 
CGAG VAKS EVSQRVFPAQVP 

GCACATGAAGCTGGTCTTGAGGCATCTAGTGGCGCGGTCGTGGAGCCATTGCAAGTTTCT 
1801 + + + + + + 18 6o 

CGTGTACTTCGACCAGAACTCCGTAGATCACCGCGCCAGCACCTCGGTAACGTTCAAAGA 
AHEAGLEASSGAVVEPLQVS 

GTGCCAGTAGCCGTAGAGAAAACTGTTTTATCTGTCGAGAAGGCGCGTGAGCTAAAGGCG 

1861 + + + + + + 1920 

CACGGTCATCGGCATCTCTTTTGAGAAAATAGACAGCTC 

VPVAV EKTVLSVEKARELKA 

GTAGATAAGGGCAAGGCGGTCGTGCACGCAAAGGAAGTCAAGAATGTACCGGTTAAGACG 
1921 + + + + + + iggO 

CATCTATTCCCGTTCCGCCAGCACGTGCGTTTCCTTGAGTTCTTA 

VDKGKAVVHAKEVKNVPVKT 
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TTACCACGAGGGGCTCTAAAAATTAGTGAGGATACCGTTCGTAAGGAATTGTGCATGTTT 

19 gX + + + + 2040 

AATGGTGCTCCCCGAGATTTTTAATCACTCCTATGGCAAGCATTCCTTAACACGTACAAA 
LPRGALK I SEDTVRKELCMF 

AGAACGTGTTCCTGCGGCG7GCAGTTGGACGTGTACAATGAAGCGACCATCGCCACTA.GG 

2041 + + + * + 2100 

TCTTGCACAAGGACGCCGCACGTCAACCTGCACATGTTACTTCGCTGGTAGCGGTGATCC 
RTCSCGVQLDVYNEATIATR 

TTCTCAAATGCGTTTACCTTTGTCGATAGCTTGAAAGGGAGGAGTGCGGTCTTTTTCTCA 

2101 + + + + + 2160 

AAGAGTTTACGCAAATGGAAACAGCTATCGAACTTTCCCTCCTCACGCCAGAAAAAGAGT 
FSNAFTFVDSLKGRSAVFFS 

AAGCTGGGTGAGGGGTATACCTATAATGGTGGTAGCCATGTTTCATCAGGGTGGCCTCGT 

2161 + + + 2220 

TTCGACCCACTCCCCATATGGATATTACCACCATCGGTACAAAGTAGTCCCACCGGAGCA 
KLGEGYTYNGGSKVSSGWPR 

GCCCTAGAGGATATCTTAACGGCAATTAAGTACCCAAGCGTCTTCGACCACTGTTTAGTG 

2221 ——————— 4.— — -.———— —_+——— __—+—— — — — — — +_——— — — +■ 2230 

CGGGATCTCCTATAGAATTGCCGTTAATTCATGGGTTCGCAGAAGCTGGTGACAAATCAC 
ALEDILTAIKYPSVFDHCLV 

CAGAAGTACAAGATGGGTGGAGGCGTACCATTCCACGCTGATGACGAGGAGTGCTATCCA 

2281 + + + + + + 2340 

GTCTTCATGTTCTACCCACCTCCGCATGGTAAGGTGCGACTACTGCTCCTCACGATAGGT 
QKYKMGGGVPFHADDEECYP 

TCAGATAACCCTATCTTGACGGTCAATCTCGTGGGGAAGGCAAACTTCTCGACTAAGTGC 

2341 * + ♦ + * + 2400 

AGTCTATTGGGATAGAACTGCCAGTTAGAGCACCCCTTCCGTTTGAAGAGCTGATTCACG 

SD-NPILTVMLVGKANFSTKC 

AGGAAGGGTGGTAAGGTCATGGTCATAAACGTAGCTTCGGGTGACTATTTTCTTATGCCT 

2401 + + * + * + 2460 

TCCTTCCCACCATTCCAGTACCAGTATTTGCATCGAAGCCCACTGATAAAAGAATACGGA 

RKGGKVMVINVASGDYFLMP 

TGCGGTTTTCAAAGGACGCACTTGCATTCAGTAAACTCCATCGACGAAGGGCGCATCAGT 

2461 + + * + + 2520 

ACGCCAAAAGTTTCCTGCGTGAACGTAAGTCATTTGAGGTAGCTGCTTCCCGCGTAGTCA 
CGFQRTHLHSVNSIDEGRIS 

TTGACGTTCAGGGCAACTCGGCGCGTCTTTGGTGTAGGCAGGATGTTGCAGTTAGCCGGC 
2521 + + + 2580 



AACTGCAAGTCCCGTTGAGCCGCGCAGAAACCACATCCGTCCTACAACGTCAATCGGCCG 
LTFRA TRRVFGVGRM LQLAG 

GGCGTGTCGGATGAGAAGTCACCAGGTGTTCCAAACCAGCAACCACAGAGCCAAGGTGCT 



2581 + + + * 2640 

CCGCACAGCCTACTCTTCAGTGGTCCACAAGGTTTGGTCGTTGGTGTCTCGGTTCCACGA 

G VSDEKSPGVPNQQPQSQGA 
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ACCAGAACAATCACACCAAAATCGGGGGGCAAGGCTCTATCTGAGGGAAGTGGTAGGGAA 

2641 TGGTCT"GTTAGTGTOTTTTTAGCCCCCCGTTCCGAGATAGACTCC^ 

TRTITPKSGGKALSEC-SGRE - 

GTCAAGGGGAGGTCGACATACTCGATATGGTGCGAACAAGATTACGTTAGGAAGTGTGAG 

_ _ — _ a. — _ _ — ■— — + 2/60 

2701 CAGTTCCCCTCCATCTGTATGA^ 

VKGRSTYSIWCEQDYVRKCE - 

TGGCTCAGGGCTGATAATCCAGTGATGGCTCTTRAACCTGGCTACACCCCAATGACATTT 

__.._.. — — ——+-'«———— ————+— — — — + 2o2U 

2761 ACCGAGTCCCGACTACTAGGTCACTACCGAGAAYTTGGACCGAT 

WLRADNPVMAL? PGYTPMTF - 

GAAGTGGTTAAAGCCGGGACCTCTGAAGATGCCGTCGTGGAGTACTTGAAGTATCTGGCT 

, — — — — — —-f 4- ZobU 

2821 — — — ——+————————— +— — — — — — — — — -r — — — — T 

CTTCACCAATTTCGGCCCTGGAGACTTCTACGGCAGCA.CCTCATGAACTTCATAGACCGA 
EVVKAGTSEDAVVEYLKYLA - 

ATAGGCATTGGGAGGACATACAGGGCGTTGCTTATGGCTAGAAATACTGCCGTCAC^ ^ 

npjii — — -f — — — — — — ^ — — — • — — — — — 

TATCCGTAACCCTCCTGTATGTCCCGCAACGAATACCGATCTTTATAACGGCAGTGATGG 
IGIGRTYRALLMARNIAVTT - 

GCCGAAGGTGTTCTGWUOT^ ^ 

noAl _ — _ — __-__+————— ————+«- — ———————+———— — — — — — -r — — 

CGGCTTCCACAAGACTTTCATGGATTAGTTCAAATACTTAGTGATGGCCCGAAAGTGCAA 
AEGVLKVPNQVYESLPGFHV - 

TACAAGTCGGGCACAGATCTCATTTTTCATTCAACACAAGACGGC 3 ^ 

3001 atcctcIgcccgtctctaSgta^gtaagttgtgw 

YKSGTDLIFHSTQDGLRVRD - 
CTACCGTACGTATTCATAGCTGAGAAAGGTATTT^ 3i2Q 

3061 GATGGCATG^TAAGTATCGACTC^TCCATAA^ 

LPYVFIAEKGIFIKGKDVDA - 

GTAGTAGCTTTGGGCGACAATCTGTCCGTATGTGATGATATATTGGTTCTCCATGAT ^ 

3121 ———————— f———— ————•+• — — + ~ 

CATCATCGAAACCCGCTGTTAGACAGGCATACACTACTATATAACCAAAAGGTACTACGA 

VVAL6DHLSVCDDILVFHDA - 

ATTAATTTGATGGGTGCACTGAAAGTTGCTCGATGTGGTATGGTGG 

3191 ___—„—.—+--———-——+"-—— ——-•+— — —————— - 

TAATTAAACTACCCACGTGACTTTC^ACGAGCTACACCATACCACCCACTTAGTAAATTC 
I NLMG ALKVARCGMVGESFK - 

TCGTTCGAATACAAATGCTATAATGCTCCCCCAGGTGGCKTAAGAC ^ 

3241 AGCA^Kra^A^GA^AC^ 

SFEYKCYNAPPGGG KTTMLV - 
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GACG AAT^GTC AAGTC ACCCAATAGCAC GGC CACCATTACGGCTAACGTGGGAAGTTCT 
3301 1.. * + - * 3360 

ctgcttaaacagttcagtgggttatcgtgccggtggtaatgccgattgcacccttcaaga 
d'e'fvxspmstatitanvgss 

GAGGACATAAATATGGCGGTGAAGAAGAGAGATCCGAATTTGGAAGGTCTCAACAGTGCT . 

3361 . + * +— - - 3420 

CTCCTGTATTTATACCGCCACTTCTTCTCTCTAGGCTTAAACCTTCCAGAGTTGTCACGA 
E*D*INMAVKKRDPNLEGLNSA 

ACCACAGTTAACTCCAGGGTGGTTAACTTTATTGTCAGGGGAATGTATAAAAGGGTTTTG 

3421 _ + + * * + 3480 

TGGTGTCAATTGAGGTCC CAC CAATTGAAATAACAGTCCC CTTACATATTTTCCCAAAAC 
TTVNS RVVNF IVRGMYKRVL 

GTGGATGAGGTGTACATGATGCATCAAGGCTTACTACAACTAGGCGTCTTCGCAACCGGC 

34gl + + + + +- : + .3540 

CACCTACTCCACATGTACTACGTAGTTCCGAATGATGTTGATCCGCAGAAGCGrtGGC.es 
VDEVYMMHQGLLQLGVFATG - 

GCGTCGGAAGGCCTCTTTTTTGGAGACATAAATCAGATACCATTCATAAACMGGGAGAAG 

3541 + + + * + + 3600 

CGCAGCC^TCCGGAGAAAAAACCTCTGTATTTAGTCTATGGTAAGTATTTGKCCCTCTTC 

ASEGLFFGDINQIPFINREK - 

GTGT^TAGGATGGATTGTGCTGTATTTGTTCCAAAGAAGGAAAGCGTTGTATACACTTCT 

3601 + + + + + 3660 

CACAAATCCTACCTAACACGACATAAACAAGGTTTCTTCCTTTCGCAACATATGTGAAGA 
VFRMDCAVFVPKKESVVYTS 

AAATCATACAGGTGTCCGTTAGATGTTTGCTACTTGTTGTCCTCAATGACCGTAAGGGGA 

3661 + + * + + + 3720 

^TTAGTATGTCCACAGGCAATCTACAAACGATGAACAACAGGAGTTACTGGCATTCCCCT 

KSYRCPLDVCYLLSSMTVRG 

ACGGAAAAGTGTTACCCTGAAAAGGTCGTTAGCGGTAAGGACAAACCAGTAGTAAGATCG 

3721 + + + + + + 3780 

TGCCTTTTCACAATGGGACTTTTCCAGCAATCGCCATTCCTGTTTGGTCATCATTCTAGC 

TEKCYPEKVVSGKDKPVVRS 

C^GTCCAAAAGGCCAATTGGAACCACTGATGACGTAGCTGAAATAAACGCTGACGTGTAC 

3781 _1 + + + + + 3840 

GACAGGTTTTCCGGTTAACCTTGGTGACTACTGCATCGACTTTATTTGCGACTGCACATG 
LSKRPIGTTDDVAE.INADVY - 

TTGTGCATGACCCAGTTGGAGAAGTCGGATATGAAGAGGTCGTTGAAGG 
. + + + + + + 3900 

II^CGTACT^T^CC 

LCMTQLEKSDMKRSLKGKGK - 

GAAACACCAGTGATGACAGTGCATGAAGCACAGGGAAAAACATTCAGTGATGTGGTAT^ 

3 g 0 l + + + + - * * 3960 

CTTTGTGGTCACTACTGTCACGTACrtCGTGTCCCTTTTTGTAAGTCACTACACCATAAC 

ETPVMTVHEAQGKTFSDVVL - 
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a 



T^TAGGACGAAGAAAGCCGATGACTCCCTATTCACTAAACAACCGC\TATACTTGTTC«T 

3951 _; + + + + ♦ » 4020 

AAATCC^GC^TCTTTCGGCTACTGAGGGATAAGTGATTTGTTGGCGTATATGAACAACCA 
FRTKKADDSLFTKQPHILVG - 

-TGTCGAGACACACACGCTCACTGGTTTATGCCGCTCTGAGCTCAGAGTTGGACGATAAG 

4Q2 - + + + * + 4050 

AACAGC-CTGTGTGTGCGAGTGACCAAATACGGCGAGACTCGAGTCTCAACCTGCTATTC 
LSRHTRSLVYAALSSELDDK - 

'AQSWTIR- 

( FRAME SHI FT) 

GTCGGCACATATATTAGCGACGCGTCGCCTCAATCAGTATCCGACGCTTTGCTTSaCACG 

+ + +: + + + 4140 

CAGCCGTGTATATAATCGCTGCGCAGCGGAGTTAGTCATAGGCTGCGAAACGAAGTGTGC 
VGTYISDAS pQSVSDALLffT - 
S A H I l> A T R H h N Q Y'PTLCFVR 

ORPli (RdSp) 

TTCGCCCCGGCTGGTTGCTTTCGAGGTATATGAGCGTATGAATTTTGGACCGACCTTCGA 

+ ---+ +■ + 4200 

41 Ia^cgg^cogacc^cgLv^ 

FAPAGCFRGI 



b S P 



RLVA FEVYERMNFGPTFE- 



aggggagttggtacggaagataccaacaagtcattttgtagccgtgaatgggtttctcga 

+ h . --+ + 4260 

tcccc^caac^tgccttctatggttgttcagtaaaacatcggcacttacccaaagagct 
ge'lvrki pt shfvavngfle - 

ggacttactcgacggttgtccggctttcgactatgacttctttgaggatgatttcgaaac 

+-—- - k ■ + — + 4320 

cctg^tgagctgccaa^ggccg^ 

dlldgcpafdydffeddfet- 
ttcagatcagtctttcctcatagaagatgtgcgcatttctgaatctttttctcattttgc 

, ! ( — 4- 43 80 

4 ^GTCTAGTCAGAAAG^GTATC 

SDQSFLIEDVRISESFSHFA- 

GTCGAAAATAGAGGATAGGTTTTACAGTTTTATTAGGTCTAGCGTAGGTTTACCAAAGCG 

_ .^^...^...j, — — 4--- — — — — — + 4440 

4381 ^T^ATCTCCTATCCAAAAra 

SKIEDRFYSFIRSSVGLPKR- 

CAACACCTTGAAGTGTAACCTCGTCACGTTTGAAAATAGGAATTCCAACGCCGATCGCGG 

4441 + + + + * 4500 

GTTGTGGAACTTCACATTGGAGCAGTGCAAACTTTTATCCTTAAGGTTGCGGCTAGCGCC 

NTLKCNLVT FE NRNSNADRG 
TTGTAACGTGGGTTGTGACGACTCTG7GGCGCATGAACTGAAGGAGATTCTCTTCGAGGA ^ 

4501 AACATTGOCCCAACACTGCTGAGACACCGCGTACTTGACTTCCTC 

CNVGCDDSVAHELKEIFFEE- 
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GGTCC-TTAACAAAGCTCGTTTAGCAGAGGTGACGGA^GCCATTTGTCCAGCAACACGAT 

4561 + + * + + + 4620 

CCAGCAATTGTTTCGAGCAAATCGTCTCCACTGCCTTTCGGTAAACAGGTCGTTGTGCTA 
b VVNKARLAEVTESHLSSNTM- 

GTTGTTATCAGATTGGTTGGACAAAAGGGCACCTAACGCTTACAAGTCTCTCAAGCGGGC 

4621 + + + + + 4680 

CAACAATAGTCTAACCAACCTGTTTTCCCGTGGATTGCGAATGTTCAGAGAGTTCGCCCG 
b LLSDWLDKRAP NAYKSLKRA 

TTTAGGTTCGGTTGTCTTTCATCCGTCTATGTTGACGTCTTATACGCTCATGGTGAAAGC 

4681 + + + + + + 4740 

AAATCCAAGCCAACAGAAAGTAGGCAGATACAACTGCAGAATATGCGAGTACCACTTTCG 
b LGSVVFHP SMLTSYTLMVKA 

AGACGTAAAACCCAAGTTGGACAATACGCCATTGTCGAAGTACGTAACGGGGCAGAATAT 

4741 + + + + + + 4800 

TCTGCATTTTGGGTTCAACCTGTTATCCGGTAACAGCTTCATGCATTGCCCCGTCTTATA 
b DVKPKLDNTPLSKYVTGQNI- 

AGTCTACCACGATAGGTGCGTAACTGCGCTTTTTTCTTGCATTTTTACTGCGTGCGTAGA 

4801 + * + + + + 4860 

TC AGATGGTGCTATC CACGCATTG ACGC G AAAAAAG AACGTAAAAATGACGCACG C ATCT 
b VYHDRCVTALFSCIFTACVE- 

GCGCTTAAAATACGTAGTGGACGAAAGGTGGCTCTTCTACCACGGGATGGACACTGCGGA 

4861 + + + + + + 4920 

CGCGAATTTTATGCATCACCTGCTTTCCACCGAGAAGATGGTGCCCTACCTGTGACGCCT 
b RLKYVVD ERWLFYHGMDTAE 

GTTGGCGGCTGCATTGAGGAACAATTTGGGGGACATCCGGCAATACTACACCTATGAACT 

4921 + + ■+■ + *+■ + 4980 

CAACCGCCGACGTAACTCCTTGTTAAACCCCCTGTAGGCCGTTATGATGTGGATACTTGA 
b LAAALRNNLGDIRQYYTYE L 

GGATATCAGTAAGTACGACAAATCTCAGAGTGCTCTCATGAAGCAGGTGGAGGAGTTGAT 

4981 + + + + + ■ + 5040 

CCTATAGTCATTCATGCTGTTTAGAGTCTCACGAGAGTACTTCGTCCACCTCCTCAACTA 
b DISKYDKSQSALMKQ.VE ELI- 

ACTCTTGACACTTGGTGTTGATAGAGAAGTTTTGTCTACTTTCTTTTGTGGTGAGTATGA 

5041 + + ■ + + + + 5100 

TGAGAACTGTGAACCACAACTATCTCTTCAAAACAGATGAAAGAAAACACCACTCATACT 
b LLTLGVDREVLSTFFCGEYD- 

TAGCGTCGTGAGAACGATGACGAAGGAATTGGTGTTGTCTGTCGGCTCTCAGAGGCGCAG 

5101 + + + + + + 5160 

ATCGCAGCACTCTTGCTACTGCTTCCTTAACCACAACAGACAGCCGAGAGTCTCCGCGTC 
b SVVRTMT KELVLSVGSQRRS 

TGGTGGTGCTAACACGTGGTTGGGAAATAGTTTAGTCTTGTGCACCTTGTTGTCCGTAGT 

5161 + + + * + + 5220 

ACCACCACGATTGTGCACCAACCCTTTATCAAATCAGAACACGTGGAACAACAGGCATCA 
b GGANTWLGNS LVLCTL LSVV- 
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ACTT * GGGGATTAGATTATAGTTATATTGTAGTTAGCGGTGATGATAGCCTTATATTTAG 

2 2 ->i + - -r i- + 5280 

TGAATCCCCTAATCTAATATCAATATAACATCAATCGCCACTACTATCGGAATATAAATC 
b L.RGLDYSYIVVSGDDSLIFS- 



TCGGCAGCCGTTGGATATTGATACGTCGGTTCTGAGCGATAATTTTGGTTTTGACGTAAA 

5281 + + + * + 5340 

AGCCGTCGGCAACCTATAACTATGCAGCCAAGACTCGCTATTAAAACCAAAACTGCATTT 
b RQPLDIDTSVLSDNFGFDVK- 

GATTTTTAACCAAGCTGCTCCATATTTTTGTTCTAAGTTTTTAGTTCAAGTCGAGGATAG 

5341 + + + + 5400 

CTAAAAATTGGTTCGACGAGGTATAAAAACAAGATTCAAAAATCAAGTTCAGCTCCTATC 

b IFNQAAPYFCSKFLVQVEDS- 

TCTCTTTTTTGTTCCCGATCCACTTAAACTCTTCGTTAAGTTTGGAGCTTCCAAAACTTC 

5401 + + * + * + 5460 

AGAGAAAAAACAAGGGCTAGGTGAATTTGAGAAGCAATTCAAACCTCGAAGGTTTTGAAG 
b L FFVPDP LKLFVKFGASKTS- 

AGATATCGACCTTTTACATGAGATTTTTCAATCTTTCGTCGATCTTTCGAAGGGTTTCAA 

5461 + - + * + * 5520 

TCTATAGCTGGAAAATGTACTCTAAAAAGTTAGAAAGCAGCTAGAAAGCTTCCCAAAGTT 
b DIDLLHEIFQSFVDLSKGFN- 

TAGAGAGGACGTCATCCAGGAATTAGCTAAGCTGGTGACGCGGAAATATAAGCATTCGGG 

5521 + ■+■ * + * + 5580 

ATCTCTCCTGCAGTAGGTCCTTAATCGATTCGACCACTGCGCCTTTATATTCGTAAGCCC 
b REDVIQELAKLVTRKYKHSG- 

ATGGACCTACTCGGCTTTGTGTGTCTTGCACGTTTTAAGTGCAAATTTTTCGCAGTTCTG 

5581 + + + * + + 5640 

TACCTGGATGAGCCGAAACACACAGAACGTGCAAAATTCACGTTTAAAAAGCGTCAAGAC 
b WTYSALCVLHVLSANFSQFC- 



TAGGTTATATTACCACAATAGCGTGAATCTCGATGTGCGCCCTATTCAGAGGACCGAGTC 

5641 . + + + + + 5700 

ATCCAATATAATGGTGTTATCGCACTTAGAGCTACACGCGGGATAAGTCTCCTGGCTCAG 
b RLYYHNSVNLDVRP IQRTES 

GCTTTCCTTGCTGGCCTTGAAGGCAAGAATTTTAAGGTGGAAAGCTTCTCGTTTTGC 

5701 + * + - + + 5760 

CGAAAGGAACGACCGGAACTTCCGTTCTTAAAATTCCACCTTTCGAAGAGCAAAACGGAA 
b LSLLALKARILRWKASRFAF^ 

TTCGATAAAGAGGGGTTAATCGCGTTGGCCACGCTATAGTGTTTCTGTGCCTCGGTTCTT 

5761 + 4. + + t + 5 820 

AAGCTATTTCTCCCCAATTAGCGCAACCGGTGCGATATCACAAAGACACGGAGCCAAGAA 

b SIKRG* 



CGTGAGGTTAATACCGAAGGGTCGTCGTACTTATCTCAGTTATTTATTTTTTCGTCTTCT 
^ , _ t- i — — + 

GCACTCCAATTATGGCTTCCCAGCAGCATGAATAGAG 



CTTAGGCGTGCCATCCGTGAAGTTAATACCGGTGGCACTCCTTCTCGAAGTGGGTATTAA 

5881 + + + * + * 5940 

GAATC CGCACGGTAGGCACTTCAATTATGGCCACCGTGAGGAAGAGCTTCAC CCATAATT 
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AGACCAAAATTTTTTATTTGTGTGTACTTTTTGTTTTGTTCACACCGTGAGGACAAGACC 
5941 + + + + + + 6000 

TCTGGTTTTAAAAAATAAACACACATGAAAAACAAAACAAGTGTGGCACTCCTGTTCTGG 
ORT2 (7K) 

GGTGGAACATGTACAGTAGAGGGTCTTTCTTTAAGTCTCGGGTTACCCTTCCTACTCTTG 

6001 + 4- 4- 4- + + 6060 

C CACCTTGTACATGTCATCTCCCAGAAAG AAATTCAG AGC CCAATGGGAAGGATGAGAAC 

MYSRG SFFKSRVTLP TLV- 

TCGGAGCATACATGTGGGAGTTTGAACTCCCGTATCTTACGGACAAGAGACACATCAGCT 

6061 + + 4- 4- + + 6120 

AGCCTCGTATGTACACCCTCAAACTTGAGGGCATAGAATGCCTGTTCTCTGTGTAGTCGA 
GAYMWEFELPYLTDKRHISY- 

ATAGCGCGCCAAGTGTCGCGACTTTTAGCCTTGTGTCGAGGTAGGATAGGGGCCAACAGG 

6121 + + + + + ■+- 6180 

TATCGCGCGGTTCACAGCGCTGAAAATCGGAACACAGCTCCATCCTATCCCCGGTTGTCC 
SAPSVATFSLVSR* 

TGACCAACAGCCTGCACTTAAGGTGCGCTGGAAGTGTTGGATTTGGTCTCAGTGTGCCAA 

6181 + + + + + + 6240 

ACTGGTTGTCGGACGTGAATTCCACGCGACCTTCACAACCTAAACCAGAGTCACACGGTT 

ATATCCTTTTAGGCGATGTACAGGAGTCTAGTTTAGTGTGTCTTTGGGGGATGACGGGAG 

6241 + + + + * + 6300 

TATAGGAAAATCCGCTACATGTCCTCAGATCAAATCACACAGAAACCCCCTACTGCCCTC 

CGACTAGGTTTAGGACTGTAGCTGCTATGTAAGTCGTGCATGCGGCATTGTGCGTAAGAC 

6301 + + + + + + 6360 

GCTGATCCAAATCCTGACATCGACGATACATTCAGCACGTACGCCGTAACACGCATTCTG 

GTGCATGCATTTGGGCGAGTGCCCTAGGGCAGCGTCGGTCAGGTGACTAGCAGCCGGCTC 

6361 + + + + + + 6420 

CACGTACGTAAACCCGCTCACGGGATCCCGTCGCAGCCAGTCCACTGATCGTCGGCCGAG 

TACGGAGCGCTGAAAGTGCTAGGTCCTGAAGGTACAGTTGGGCTGAGGOVGGACATGGTT 

6421 + + + + + + 6480 

ATGCCTCGCGACTTTCACGATCCAGGACTTCCATGTCAACCCGACTCCGTCCTGTACCAA 

GAACGAGTTGACCGTGGGGACCAGCGGCGGTGACTCGGGCCGTAGCCACGCGCGGGGCGG 

6481 + + + 4- + + 6540 

CTTGCTCAACTGGCACCCCTGGTCGCCGCCACTGAGCCCGGCATCGGTGCGCGCCCCGCC 

CAGGGCGTCTCGTGGTGTATCTGGGCAAGATACGGCTTTATTAGGCACCATAATATGGAG 

6541 + + +- + — + + 6600 

GTCCCGCAGAGCACCACATAGACCCGTTCTATGCCGAAATAATCCGTGGTATTATACCTC 

CCCAAAGCGTCGGGGTCGGGAAACATCTCCATAGCTTAGTGGCAGCAGCCTAAGATAGGC 

6601 + -r 4- 4- + 4- 6660 

GGGTTTCGCAGCCCCAGCC CTTTGTAGAGGTATCGAATCAC CGTCGTCGGATTCTATCCG 

TGGGAGGCCCGTTCCCTGTAGTAGTGGTGGGTTAGCATGCCACTAAGCGGTGCGGCGTGA 

6661 + + 4- 4- + •+> 6720 

ACCCTCCGGGCAAGGGACATCnTCACCACCCAATCGTACGGTGATTCGCCACGCCGCACT 



WO 97/22700 



121 



PCT7US96/20747 



liCCGTCCGTAGTTAGGCGACCCGTGTTTTAATAGGGTCTCTTTAGTTAAGT 



TAAGGCGCCACCG7CCGTAGTTAb«v.fc«i._ ^ - *~ _______ + e 78 0 

6721 A^CCS^raC^TcL^C^ 

ttakcatgtcgtacactt^ 6a40 

AATCCGTACAGCATGTCAATCCTAAAG^ 

TAGTTTAGATGTACATTATTACGTACKT^ ^ 

6841 a^caaatctacatct^tIatc^ 

TTGTGTGTAGCCTTTAATGTAGGTTTCTTTGTTTTATTTTTGCCTCTC^ 

6901 Z£^r£^ 

TCTTTTCTTCTATTTAGGTTTATCTTCTTTCCTTAGTG^GTCGTATATCA^ ^ 

6961 agaaaagaagataaa^ccaaI^agaIgaa! 

CAAATTATGAATTTTCCTTCGTGTAGGCGTCG^ ^ 
OT^AATACT^AAAAKAAG^^ 

gtttagtggcgacataaataggtttttgckgagattggga^ 7i4q 

7081 ^^ccg^gtIt^atccaaaaa^ 

aagagaaatcggggaac^gccacgcgaatgacc^cgtgctga^g^^ 72qo 

OR?3 (5K, Membrane protoia) 

GATTTTATATTGAAGTAGGCGTATTTCTTTATGGATGATCT 7260 

7201 waaaa^aaotcatccSaaZca^^ _ 



7021 



CrAGTAGTCGATTTTGTCTTCGTGATAATTCTGCTGCTK _____ 

gIt^gctaaaacag^ _ 

LVVDFVFVIILLLVLTFVVP 

jwrnaaxc*^^ 7380 

TCCAATGTCGTCTCGAGGTGGTAACTATGTCCA^ 
RLQQSSTINTGLRTV 



7321 



rvavJSlcSS^^ 7440 

7381 . 

MEVGIDFGTTFSTICFSP 
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TCTGGGGTCAGCGGTTGTACCCCTGTGGCCGGTAGTGTTTACGTTGAAACCCAAATTTTT 

7441 * + + + * 7500 

AGACCCCAGTCGCCAACATGAGGACACCGGCCATCACAAATGCAACTTTGGGTTTAAAAA 

SGVSGCTPVAGSVYVETQI? 

AT\CC m GAAGGTAGCAGTACTTACTTAATTGGTAAAGCTGCGGGGAAAGCTTATCGTGAC 

7501 —1—2 + + + + — * + 7560 

TATGGACTTCCATCGTCATGAATGAATTAACCATTTCGACGCCCCTTTCGAATAGCACTG 
IPEGS STYLIGKAAGKAYRD - 

GGTGTAGAGGGAAGGTTGTATGTTAACCCGAAAAGGTGGGCAGGTGTGACGAGGGATAAC 

7561 - + + + + + 7620 

CCACATCTCCCTTCCAACATACAATTGGGCTTTTCCACCCGTCCACACTGCTCCCTATTG 

GVEGRLYVNPKRWAGVTRDK 

GTCGAACGCTACGTCGAGAAATTAAAACCTACATACACCGTGAAGATAGACAGCGGAGGC 

7621 * + + + + + 7680 

CAGCTTGCGATGCAGCTCTTTAATTTTGGATGTATGTGGCACTTCTATCTGTCGCCTCCG 

VERYVEKLKPTYTVKI 0SGG 

GCCTTATTAATTGGAGGTTTAGGTTCCGGACCAGACACCTTATTGAGGGTCGTTGACGTA 

7681 * + + + + + 7740 

C GG AATAATTAAC CTCC AAATCC AAGGCCTGGTCTGTGGAATAACTCCCAGCAACTGCAT 
ALLIGGL6SG PDTLLRVVDV - 

ATATGTTTATTCTTGAGAGCCTTGATACTGGAGTGCGAAAGGTATACGTCTACGACGGTT 

7741 ~ + + *— - * + — * 7800 

^ATACAAATAAGAACTCTCGGAACTATGACCTCACGCTTTCCATATGCAGATGCTGCCAA 

ICLFLRALILECE RYTSTTV - 

ACAGCAGCTGTTGTAACGGTACCGGCTGACTATAACTCCTTTAAACGAAGCTTCGTTGTT 

7801 + + + — * + + 7860 

TGTCGTCGACAACATTGCCATGGCCGACTGATATTGAGGAAATTTGCTTCGAAGCAACAA 

TAAVVTVPADYNSFKRSFVV 

GAGGCGCTAAAAGGTCTTGGTATACCGGTTAGAGGTGTTGTTAACGAACCGACGGCCGCA 

7861 * + + + + + 7920 

CTCCGCGATTTTCCAGAACCATATGGCCAATCTCCACAACAATTGCTTGGCTGCCGGCGT 

EALKGLGIPVRGVVNEPTAA - 

GCCC^CTATTCCTTAGCTAAGTCGCGAGTAGAAGACCTATTATTAGCGGTTTTTGATTTT 

7921 — + * + + + + 7980 

CGGGAGATAAGGAATCGATTCAGCGCTCATCTTCTGGATAATAATCGCCAAAAACTAAAA 

ALYSLAKSRVEDLLLAVFDF - 

GGGGGAGGGACTTTCGACGTCTCATTCGTTAAGAAGAAGGGAAATATACTATGCGTCATC 

7981 + * + - * * 8040 

CCCCCTCCCTGAAAGCTGCAGAGTAAGCAATTCTTCTTCCCTTTATATGATACGCAGTAG 

GGGTFDVSFVKKKGNILCVI - 

TTTTCAGTGGGTGATAATTTCTTGGGTGGTA 

8041 * - + * + " + 8100 

AAAAGTCACCCACTATTAAAGAACCCACCATCTCTATAACTATCTCGATAGCACCTTCAA 

FSVGDNFLGGRDIDRAIVEV - 
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ATCAAACAA^GATCAAAGGAAAGGCGTCTGATGCG^GTTAGGGATATTCGT-VTCCTCG 

8101 + + ~ + : + 8160 

TAGTTTGTTTTCTAGTTTCCTrTCCGCAGACTACGGTTCAATCCCTATAAGCATAGGAGC 
IKQKIKGXASDAKLGIFVSS 

ATGAAGGAAGACTTGTCTAACAATAACGCTATAACGCAACACCTTATCCCCGTAGAAGGG 

8161 + + + + + 8220 

TACTTCCTTCTGAACAGATTGTTATTGCGATATTGCGTTG7GGAATAGGGGCATCTTCCC 
MKEDLSNNNAITQHLIPVEG 

GGTGTGGAGGTTGTGGATTTGACTAGCGACGAACTGGACGCAATCGTTGCACCATTCAGC 

8221 + + + + — + ♦ 8280 

CCACACCTCCAACACCTAAACTGATCGCTGCTTGACCTGCGTTAGCAACGTGGTAAGTCG 
GVEVVDLTSDELDAIVAPFS 

GCTAGGGCTGTGGAAGTATTCAAAACTGGTCTTGACAACTTTTACCCAGACCCGGTTATT 

8281 + + + + + 8340 

CGATCCCGACACCTTCATAAGTTTTGACCAGAACTGTTGAAAATGGGTCTGGGCCAATAA 
AR'AVEVFKTGLDNFYPDPV'I 

GCCGTTATGACTGGGGGGTCAAGTGCTCTAGTTAAGGTCAGGAGTGATGTGGCTAATTTG 

8341 + + + + + +'8400 

CGGCAATACTGACCCCCCAGTTCACGAGATCAATTCCAGTCCTCACTACACCGATTAAAC 
AVMTGGS SALVKVRSDVANL 

CCGCAGATATCTAAAGTCGTGTTCGACAGTACCGATTTTAGATGTTCGGTGGCTTGTGGG 

8401 + + + + + + 8460 

GGCGTCTATAGATTTCAGCACAAGCTGTCATGGCTAAAATCTACAAGCCACCGAACACCC 
PQISKVVFDSTDFRCSVACG 

GCTAAGGTTTACTGCGATACTTTGGCAGGTAATAGCGGACTGAGACTGGTGGACACTTTA 

8461 + + + + + + 6520 

CGATTCCAAATGACGCTATGAAACCGTCCATTATCGCCTGACTCTGACCACCTGTGAAAT 
AKVYCDTLAGNSGLRLVDTL 

ACGAATACGCTAACGGACGAGGTAGTGGGTCTTCAGCCGGTGGTAATTTTCCCGAAAGGT 

8521 + + + + + — + 8580 

TGCTTATGCGATTGCCTGCTCCATCACCCAGAAGTCGGCCACCATTAAAAGGGCTTTCCA 
TNTLTDEVVGLQPVVIFFKG 

AGTCCAATACCCTGTTCATATACTCATAGATACACAGTGGGTGGTGGAGATGTGGTATAC 

8581 + + + + + + 8640 

TCAGGTTATGGGACAAGTATATGAGTATCTATGTGTCACCCACCACCTCTACACCATATG 
SPIPCSYT'HRYTVGGGDVVY 

GGTATATTTGAAGGGGAGAATAACAGAGCTTTTCTAAATGAGCCGACGTTCC 

8641 + + + + + + 8700 

CCATATAAACTTCCCCTCTTATTGTCTCGAAAAGATTTACTCGGCTGCAAGGCCCCGCAT 
GIFEGENNRAFLNE PTFRGV 

TCGAAACGTAGGGGAGACCCAGTAGAGACCGACGTGGCGCAGTTTAATCTCTCCACGGAC 

8701 + + + + + + 8760 

AGCTTTGCATCCCCTCTGGGTCATCTCTGGCTGCACCGCGTCAAATTAGAGAGGTGCCTG 
SKRRGDPVETDVAQ FNLSTD 



WO 97/22700 



124 



PCTAJS96/20747 



GGAACGGTGTCTGTTATCGTTAATGGTGAGGAAGTAAAGAATGAATATCTGG7ACCCGGG 

3751 ^ + ^ + + -r 8820 

CCTTGCCACAGACAATAGCAATTACCACTCCTTCATTTCTTACTTATAGACCATGGGCCC 
a GTVSVIVNGEEVENEYLVPG 

ACAACAAACGTACTGGATTCATTGGTCTATAAATCTGGGAGAGAAGATTTAGAGGCTAAG ' 

8821 * + + + + -+ 8880 

TGTTGTTTGCATGACCTAAGTAACCAGATATTTAGACCCTCTCTTCTAAATCTCCGATTC 
a TTNVLDSLVYKSGREDLEAK 



GCAATACCAGAGTACTTGACCACACTGAATATTTTGCACGATAAGGCTTTCACGAGGAGA 

8881 + + + + + + 8940 

CGTTATGGTCTCATGAACTGGTGTGACTTATAAAACGTGCTATTCCGAAAGTGCTCCTCT 
a AIPEYLTTLNILHDKAFTRR 

AACCTGGGTAACAAAGATAAGGGGTTCTCGGATTTAAGGATAGAAGAAAATTTTTTAAAA 

8941 + + + -r 9000 

TTGGACCC^TTGTTTCTATTCC CCAAGAGC CTAAATTCCTATCTTCTTTTAAAAAATTTT 
a NLGNKDKGFSDLRIEENFLK 

ORF5 (HSP90 Homolog) 

TCCGCCGTAGATACAGACACGATTTTGAATGGATAAATATATTTATGTAACGGGGATATT 

9001 + + + + + + 9060 

AGGCGGCATCTATGTCTGTGCTAAAACTTACCTATTTATATAAATACATTGCCCCTATAA 
a SAVDTDTI LNG* 

b MDKYIYVTGIL - 



AAACCCTAACGAGGCTAGAGACGAGGTATTCTCGGTAGTGAATAAGGGATATATTGGACC 

9061 + + + + + + 9120 

TTTGGGATTGCTCCGATCTCTGCTCCATAAGAGCCATCACTTATTCCCTATATAACCTGG 
b NPNEARDEVFSVVNKGYIGP- 

GGGAGGGCGCTCCTTTTCGAATCGTGGTAGTAAGTACACCGTCGTCTGGGAAAACTCTGC 

9121 + + +■ + + + 9180 

CCCTCCCGCGAGGAAAAGCTTAGCACCATCATTCATGTGGCAGCAGACCCTTTTGAGACG 
b GGRSF S NRG S K YTVVWEN S A 

TGCGAGGATTAGTGGATTTACGTCGACTTCGCAATCTACGATAGATGCTTTCGCGTATTT 

9181 + + + + «■ + 9240 

ACGCTCCTAATCACCTAAATGCAGCTGAAGCGTTAGATGCTATCTACGAAAGCGCATAAA 
b ARISGFTSTSQSTIDAFAYF 

CTTGTTGAAAGGCGGATTGACTACCACGCTCTCTAACCCAATAAACTGTGAGAATTGGGT 

9241 + + — + + + 9300 

GAACAACTTTCCGCCTAACTGATGGTGCGAGAGATTGGGTTAT 
b LLKGGLTTTLSNPINCENWV- 

CAGGTCATCTAAGGATTTAAGCGCGTTTTTCAGGACCCTAATTAAA^ 

9301 + + + + + + 9360 

GTCCAGTAGATTCCTAAATTCGCGCAAAAAGTCCTGGGATTAATTTCCATTCTAAATACG 
b RSSKDLSAFFRTLIKGKIYA- 

ATCGCGTTCTG7GGACAGCAATCTTCCAAAGAAAGACAGGGATGACATCATGGAAGCGAG 

9361 + + + + + + 9420 

TAGCGCAAGACACCTGTCGTTAGAAGGTTTCTTTCTGTCCCTACTGTAGTACCTTCGCTC 
b SRSVDSNLPKKDRDDIMEAS- 
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TCGACGACTATCGCC\TCGGACGCCGCCTTTTGCAGAGCAGTGTCGGTTCAGGTAGGGAA 

9 42^ + + + + * 9480 

AGC^GCTGATAGCGGTAGCCTGCGGCGGAAAACGTCTCGTCACAGCCAAGTCCATCCCTT 

RRLSPSDAAFCRAVSVQVGK- 

■ GTATGTGGACGTAACGCAGAATTTAGAAAGTACGATCGTGCCGTTAAGAGTTATGGAAAT 

9481 + + * + * + 9540 

CATACACCTGCATTGCGTCTTAAATCTTTCATGCTAGCACGGCAATTCTCAATACCTTTA 
YVDVTQNLESTIVPLRV MEI- 

AAAGAAAAGACGAGGATCAGCACATGTTAGTTTACCGAAGGTGGTATCCGCTTACGTAGA 

9541 + + + + + + 9600 

TTTCTTTTCTGCTCCTAGTCGTGTACAATCAAATGGCTTCCACCATAGGCGAATGCATCT 
KKRRGSAHVSLPKVVSAYVD- 

TTTTTATACGAACTTGCAGGAATTGCTGTCGGATGAAGTAACTAGGGCCAGAACCGATAC 

9601 + + + - + + 9660 

AAAAATATGCTTGAACGTCCTTAACGACAGCCTACTTCATTGATCCCGGTCTTGGCTATG 
FYTNLQELLSDEVTKARTDT- 

AGTTTCGGCATACGCTACCGACTCTATGGCTTTCTTAGTTAAGATGTTACCCCTGACTGC 

9661 + * + - + 9720 

TCAAAGCCGTATGCGATGGCTGAGATACCGAAAGAATCAATTCTACAATGGGGACTGACG 
VSAYATDSMAFLVKMLPLTA 

TCGTGAGCAGTGGTTAAAAGACGTGCTAGGATATCTGCTGGTACGGAGACGACCAGCAAA 

9721 * + + + ~ "+ 9780 

AGCACTCGTCACCAATTTTCTGCACGATCCTATAGACGACCATGCCTCTGCTGGTCGTTT 

REQWLKDVLGYLLVRRRPAN- 

TTTTTCCTACGACGTAAGAGTAGCTTGGGTATATGACGTGATCGCTACGCTCAAGCTGGT 

9 78 1 + + + + * * 9840 

AAAAAGGATGCTGCATTCTCATCGAACCCATATACTGCACTAGCGATGCGAGTTCGACCA 
FSYDVRVAWVYDVIATLKLV- 

CATAAGATTGTTTTTCAACAAGGACACACCCGGGGGTATTAAAGACTTAAAACCGTGTGT 

9841 + + + + * + 9900 

GTATTCTAACAAAAAGTTGTTCCTGTGTGGGCCCCCATAATTTCTGAATTTTGGCACACA 
IRLFFNKDTPGGIKDLKPCV- 

GCCTATAGAGTCATTCGACCCCTTTCACGAGCTTTCGTCCTATTTCTCTAGGTTAAGTTA 

9901 + + + + * 9960 

CGGATATCTCAGTAAGCTGGGGAAAGTGCTCGAAAGCAGGATAAAGAGATCCAATTCAAT 

P I ESFD P F HELS SYF SR LSY 

CGAGATGACGACAGGTAAAGGGGGAAAGATATGCCCGGAGATCGCCGAGAAGTTGGTGCG 

9961 * + * + + 10020 

GCTCTACTGCTGTCCATTTCCCCCITTCTATACGGGCCTCTAGCGGCTCTTCAACCACGC 

EMTTGKGGKICPEIAEKLVR- 

CCGTCTAATGGAGGAAAACTATAAGTTAAGATTGACCCCAGTGATGGCCTTAATAATTAT 

X0021 * + * * + + 10080 

GGCAGATTACCTCCTTTTGATATTCAATTCTAACTGGGGTCACTACCGGAATTATTAATA 

RLMEENYKLRLTPVMALIII- 
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ACTGGTATACTACTCCATTTACGGCACAAACGCTACCAGGATTAAAAGACGCCCGGATTT 

10081 + ^ + + + 10140 

TGACCATATGATGAGGTAAATGCCGTGTTTGCGATGGTCZTAATTTTCTGCGGGCCTAAA 
b LVYYSI YGTNATRIKRRPDF - 

CCTCAATGTGAGGATAAAGGGAAGAGTCGAGAAGGTTTCGTTACGGGGGGTAGAAGATCG 

10141 + * + + + +.10200 

GGAGTTACACTCCTATTTCCCTTCTCAGCTCTTCCAAAGCAATGCCCCCCATCTTCTAGC 
b LNVRIKGRVEKVS LRGVEDR- 

TGCCTTTAGAATATCAGAAAAGCGCGGGATAAACGCTCAACGTGTATTATGTAGGTACTA 

10201 + + + + + + 10260 

ACGGAAATCTTATAGTCTTTTCGCGCCCTATTTGCGAGTTGCACATAATACATCCATGAT 
b AFRISEKRGINAQRVLCRYY- 

TAGCGATCTCACATGTCTGGCTAGGCGACATTACGGCATTCGCAGGAACAATTGGAAGAC 

10261 + + + + + + 10320 

ATCGCTAGAGTGTACAGACCGATCCGCTGTAATGCCGTAAGCGTCCTTGTTAACCTTCTG 
b SDLTCLARRHYGIRRNNWKT- 

GCTGAGTTATGTAGACGGGACGTTAGCGTATGACACGGCTGATTGTATAACTTCTAAGGT 

10321 + + + + + + 10380 

C GACTCAATAC ATCTGCC CTGCAATCGCATACTGTGCCGACTAACATATTGAAGATTCCA 
b LSYVDGTLAYDTADC ITSKV- 

GAGAAATACGATCAACACCGCAGATCACGCTAGCATTATACACTATATCAAGACGAACGA 

10381 + + + * + + 10440 

CTCTTTATGCTAGTTGTGGCGTCTAGTGCGATCGTAATATGTGATATAGTTCTGCTTGCT 
b RNTINTADHASIIHYIKTNE- 

AAACCAGGTTACCGGAACTACTCTACCACACCAGCTTTAAAGCTGCGTGTAGTATGCGAC 

10441 + + + + * 10500 

TTTGGTCCAATGGCCTTGATGAGATGGTGTGGTCGAAATTTCGACGCACATCATACGCTG 

b NQVTGTTLPHQL* 

GATGTTTCTCGTATTAGTTTTATAAAAATTTTTAATTGCTCTGTGTGTGGTTTTTGTTGA 

10501 + * + + * 10560 

CTACAAAGAGCATAATCAAAATATTTTTAAAAATTAACGAGACACACACCAAAAACAACT 

ORF6 (Coat protein) 

GTGAACGCGATGGCATTTGAACTGAAATTAGGGCAGATATATGAAGTCGTCCCCGAAAAT 

10561 * + + + * * 10620 

CACTTGCGCTAC CGTAAACTTGACTTTAATC CCGTCTATATACTTCAGC AGGGGCTTTTA 
a MAFELKLGQIYEVVPEN 

AATTTGAGAGTTAGAGTGGGGGATGCGGCACAAGGAAAATTTAGTAAGGCGAGTTTC 

10621 + + + + + * 10680 

TTAAACTCTCAATCTCACCCCCTACGCCGTGTTCCTTTTAAATCATTCCGCTCAAAGAAT 

a NLRVR'VGDAAQGKFSKASFL 

AAGTACGTTAAGGACGGGACACAGGCGGAATTAACGGGAATCGCCGTAGTGCCCGAAAAA 

10681 + + + + + + 10740 

TTCATGGAATTCCTGCCCTGTGTCCGCCTTAATTGCCCTT^ 
a KYVKDGTQAELTGIAVVPEK - 
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TACGTATTCGCCACAGCAGCTTTGGCTACAGCGGCGCAGGAGCCACCTAGGCAGCCACCA 

10 741 + + * * + + 10800 

ATGCATAAGCGGTGTCGTCGAAACCGATGTCGCCGCGTCCTCGGTGGATCCG7CGGTGGT 

YVFATAALATAAQEPPRQP? 
GCGCAAGTGGCGGAACCACAGGAAACCGATATAGGGGTAGTGCCGGAATCTGAGACTCTC 

ioaoi «■ + + ; 10860 

CGCGTTCACCGCCTTGGTGTCCTTTGGCTATATCCCCATCACGGCCTTAGACTCTGAGAG 
AQVAEPQETDIGVVPESETL 

ACACCAAATAAGTTGGTTTTCGAGAAAGATCCAGACAAGTTCTTGAAGACTATGGGCAAG 

10861 +- + * + + + 10920 

TGTGGTTTATTCAACCAAAAGCTCTTTCTAGGTCTGTTCAAGAACTTCTGATACCCGTTC 

TPNKLVFEKDPDKFLKTMGK - 

GGAATAGCTTTGGACTTGGCGGGAGTTACCCACAAACCGAAAGTTATTAACGAGCCAGGG 

103 21 + + + + + + 10980 

CCTTATCGAAACCTGAACCGCCCTCAATGGGTGTTTGGCTTTCAATAATTGCTCGGTCCC 

GIALDLAGVTHKPKVINEPG - 

AAAGTATCAGTAGAGGTGGCAATGAAGATTAATGCCGCATTGATGGAGCTGTGTAAGAAG 

109 B1 + + * - + + 

TTTCATAGTCATCTCCACCGTTACTTCTAATTACGGCGTAACTACCTCGACACATTCTTC 
KVSVEVAMKINAALMELCKK 

GTTATGGGCGCCGATGACGCAGCAACTAAGACAGAATTCTTCTTGTACGTGATGCAGATT 

11041 + + + +~ + 11100 

CAATACCCGCGGCTACTGCGTCGTTGATTCTGTCTTAAGAAGAACATGCACTACGTCTAA 

VMGADDAATKTEFFLYVMQI - 

GCTTGCACGTTCTTTACATCGTCTTCGACGGAGTTCAAAGAGTTTGACTACATAGAAACC 

11101 + + + + + + 11160 

CGAACGTGCAAGAAATGTAGCAGAAGCTGCCTCAAGTTTCTCAAACTGATGTATCTTTGG 

actfftssstefkefdyiet - 

GATGATGGAAAGAAGATATATGCGGTGTGGGTATATGATTGCATTAAACAAGCTGCTGCT 

11161 + + + + + + H220 

CTACTAC CTTTCTTCTATATAC GC CACAC C CATATACTAACGT AATTTGTT CGACG ACGA 
DDGKKIYAVWVYDCIKQAAA - 

TCGACGGGTTATGAAAACCCGGTAAGGCAGTATCTAGCGTACTTCACACCAACCTTQVTC 
11221 + + + * * + 11280 

agctgcccaatacttttgggccattccgtcatagatcgcatgaagtgtggttggaagtag 
stgyenpvrqylayftptfi - 

ACGGCGACCCTGAATGGTAAACTAGTGATGAACGAGAAGGTTATGGCACAGCATGGAGTA 

U281 + + * * + 11340 

TGCCGCTGGGACTTACCATTTGATCACTACTTGCTCTTCCAATACCGTGTCGTACCTCAT 

TATLN GKLVMNEKVMAQHGV - 

CCACCGAAATTCTTTCCGTACACGATAGACTGCGTTCGTCCGACGTACGATCTGTTCAAC 

+ + 4- + + — " + 11400 

^TGGC^ 

ppkffpytipcvrptydlfn - 
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AACGACGCAATATTAGCATGGAATTTAGCTAGACAGCAGGCGTTTAGAAACAAGACGGTA 

11401 + + + * * + H460 

TTGCTGCGTTATAATCGTACCTTAAATCGATCTGTCGTCCGCAAATCTTTGTTCTGCCAT 
NDAILAWNLARQQAFP.NKTV 

ACGGCCGATAACACCTTACACAACGTCTTCCAACTATTGCAAAAGAAGTAGCTACGATCG 

11461 + * + + + * H520 

TGCCGGCTATTGTGGAATGTGTTGCAGAAGGTTGATAACGTTTTCTTCATCGATGCTAGC 
TADNTLHNVFQLLQKK* 

ORP7 <CPr) 

ATGTCTATAAATTGGTGAAAAATTTAGAAATATTTACCTTTTATTGATAATTCATGGGAG 

U521 + + + + + + H530 

TACAGATATTTAACCACTTTTTAAATCTTTATAAATGGAAAATAACTATTAAGTACCCTC 

M S I N W * 

M G A - 

C^TATACACATGTAGACTTTCATGAGTCGCGGTTGCTGAAAGACAAACAAGACTATCTTT 

H581 + + + + + - H640 

GAATATGTGTACATCTGAAAGTACTCAGCGCCAACGACTTTCTGTTTGTTCTGATAGAAA 
YTHVDFHE S RLLKDXQDYL S - 

CTTTCAAGTCAGCGGATGAAGCTCCTCCTGATCCTCCCGGATACGTTCGCCCAGATAGTT 

11641 * * + + * * 11700 

GAAAGTTCAGTCGCCTACTTCGAGGAGGACTAGGAGGGCCTATGCAAGCGGGTCTATCAA 

FKSADEAP PDPPGYVRPDSY- 

ATGTGAGGGCTTATTTGATACAAAGAGCAGACTTTCCCAATACTCAAAGCTTATCAGTTA 

117 01 + + + * + + 11760 

TACACTCCCGAATAAACTATGTTTCTCGTCTGAAAGGGTTATGAGTTTCGAATAGTCAAT 

VRAYLIQRADFPNTQSLSVT- 

CGTTATCGATAGCCAGTAATAAGTTAGCTTCAGGTCTTATGGGAAGCGACGCAGTATCAT 

U 761 * — + * + + + 11820 

GCAATAGCTATCGGTCATTATTCAATCGAAGTCCAGAATACCCTTCGCTGCGTCATAGTA 

LS IASNKLASGLMGSDAVS S - 

CGTCGTTTATGCTGATGAACGACGTGGGAGATTACTTCGAGTGCGGCGTGTGTCACAACA 

11821 + * * + + * 11880 

GCAGCAAATACGACTACTTGCTGCACCCTCTAATGAAGCTCACGCCGCACACAGTGTTGT 

S FMLMNDVGDYF ECGVCHNK- 

AACCCTACTTAGGACGGGAAGTTATCTTCTGTAGGAAATACATAGGTGGGAGAGGAGTGG 

11881 + + * + * + H940 

TTGGGATGAATCCTGCCC1TCAATAGAAGACATCC1!TTATGTATCCACCCTCTCCTCA 
PYLGREVIFCRKYIGGRGVE- 

AGATCACCACTGGTAAGAACTACACGTCGAACAATTGGAACGAGGCGTCGTACGTAATAC 

119 41 + - + + + + + 12000 

TCTAGTGGTGACCATTCTTGATGTGCAGCTTGTTAA 

ITTGKNYTSNNWNEASYVIQ- 

AAGTGAACGTAGTCGATGGGTTAGCACAGACCACTGTTAATTCTACTTATACGCAAACGG 

120 01 + + + + + * 12060 

TTCACTTGCATCAGCTACCCAATCGTGTCTGGTGACAATTAAGATGAATATGCGTTTGCC 

VNVVDG LAQTTVN STYTQTD- 
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ACGTTAGTGGTCTACCCAAAAATTGGACGCGTATCTACAAAATAACAAAGATAGTGTCCG 

12061 + + + + 12120 

TGCAATCACCAGATGGGTTTTTAACCTGCGCATAGATGTTTTATTGTTTCTATCACAGGC 
VSGLPKNWTRIYKITKIV-SV- 

TAGATCAGAACCTCTACCCTGGTTGTTTCTCAGACTCGAAACTGGGTGTAATGCGTATAA 

12121 +■ + + + + 12180 

ATCTAGTCTTGGAGATGGGACCAACAAAGAGTCTGAGCTTTGACCCACATTACGCATATT 
D Q N L . Y P G C F S D S K L GVMR I R- 

GGTCACTGTTAGTTTCCCCAGTGCGCATCTTCTTTAGGGATATCTTATTGAAACCTTTGA 

12181 + + + + + + 12240 

CCAGTGACAATCAAAGGGGTCACGCGTAGAAGAAATCCCTATAGAATAACTTTGGAAACT 
SLLVSPVRIFFRDXLLKPL K- 

AGAAATCGTTCAACGCAAGAATCGAGGATGTGCTGAATATTGACGACACGTCGTTGTTAG 

12241 + + + + + + 12300 

TCTTTAGCAAGTTGCGTTCTTAGCTCCTACACGACTTATAACTGCTGTGCAGCAACAATC 
KS FNARI EDVLNIDDTSL LV- 

TACCGAGTCCTGTCGTACCAGAGTCTACGGGAGGTGTAGGTCCATCAGAGCAGCTGGATG 

12301 + + + + + + 12360 

ATGGCTCAGGACAGCATGGTCTCAGATGCCCTCCACATCCAGGTAGXCTCGTCGACCTAC 
PSPVVPESTGGVGPSEQLDV- 

TAGTGGCTTTAACGTCCGACGTAACGGAATTGATCAACACTAGGGGGCAAGGTAAGATAT 

12361 + + + + + + 12420 

ATCACCGAAATTGCAGGCTGCATTCCCTTAACTAGTTGTGATCCCCCGTTCCATTCTATA 
VALTSDVTELINTRGQGKIC- 

GTTTTCCAGACTCAGTGTTATCGATCAATGAAGCGGATATCTACGATGAGCGGTATTTGC 

12421 + + + + + + 12480 

CAAAAGGTCTGAGTCACAATAGCTAGTTACTTCGCCTATAGATGCTACTCGCCATAAACG 
F P DSVL S INEAD I YD ERYL P - 

CGATAACGGAAGCTCTACAGATAAACGCAAGACTACGCAGACTCGTTCTTTCGAAAGGCG 

12481 + + + + + 12540 

GCTATTGCCTTCGAGATGTCTATTTGCGTTCTGATGCGTCTGAGCAAGAAAGCTTTCCGC 
ITEALQINARLRRLVLSKGG- 

GGAGTCAAACACCACGAGATATGGGGAATATGATAGTGGCCATGATACAACTTTTCGTAC 

12541 + + + + + + 12600 

CCTCAGTTTGTGGTGCTCTATACCCCTTATACTATCACCGGTACTATGTTGAAAAGCATG 
SQTPRDMGNMIVAMIQLFVL- 

TCTACTCTACTGTAAAGAATATAAGCGTCAAAGACGGGTATAGGGTGGAGACCGAATTAG 

12601 + + ♦ + + + 12660 

AGATGAGATGACATTTCTTATATTCGCAGTTT^ 

YSTVKNI SVKDGYRVETEL G- 

GTCAAAAGAGAGTCTACTTAAGTTATTCGGAAGTAAGGGAAGCTATATTAGGAGGGAAAT 

12661 + + + + + + 12720 

CAGTTTTCTCTCAGATGAATTCAATAAGCCTTCATTCCCTTCGATAT^ 

QKRVYLSYSEVREAILGGK'Y- 
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a 
c 



ACGGTGCGTCTCCAACCAACACTGTGCGATCCTTCATGAGK3TATTTTGCTCACACCACTA 

12721 "GC^CGCAGAGGTTGGTTGTGACACGCTAGG^ 

*G *AS?"NTVSS FMRYFAHTT i - 

TTACTCTACTTATAGAGAAGAAAATTCAGCCAGCGTGTACTGCCCTAGCTAAGCACGGCG 

12 7£i + + * * ' ~ 12840 

AATGAGATGAATATCTC^^CTTTTAAGTCGGTCGCACATGACGGGATCGATTCGTGCCGC 

TCCCGAAGAGGTTCACTCCGTACTGCTTCGACTTCGCACTACTKATMCAGATA^ACC 

12 8 41 r^cttctcc^gtgaot^tgIcgaaktgaagcgtgatgacctattgtctataa^ 

PKRFTPYC FDFALLDNRYY P- 

cggcggacgtgttgaaggctaacgcaatggcttgcgctatagcgattaaatcagctaatt 

, — — » — — +— — + 12 jot) 

12901 - —4.-- «---- — f 

GCCGCCTGCAC* TVCTTCCGATTGCGTTACCGAACGCGATATCGCTAATTTAGTCGATTAA 
ADVLKA NAMACAIAIKSANL- 

ORF8 

TAAGGCGTAAAGGTTCGGAGACGTATAACATCTTAGAAAG^TTT^ ^ ^ 

129 6 1 ATOCCGCAT^CCAAGCCTCTTCATATTGTAGAATCTC 



R R K G S 



ETYNILESI 



GAATTCAGACCMOTTTJ*^^^ i3q8o 

s'frpvlitvrrdpgvntgsl - 

AAAGTGATAGCTTATGACTTACACTACGACAATATA^CGATAAC^ 
^TCACTATCG^TACTGAATGTGATGCTGT^ 

KVIAYDLHYDNIFDNCAVKS - 
TTTCGAGACACCGACACTGGATTCACTGTTATGA^GAATACTCGAC^ 

13141 a^g^ctgSctgtgIcctaIgtoac^tactttctt^ 



13081 



FRDTDTGFT 



VMKEYSTNSAF 



ATACTAAGTCCTTATAAACTGTTTTCCG^^ 

ILSPYKLFSAVFNKEGEMIS - 
JUCffiOOTUKSMO^^ 13320 

NDVGSSFRVYNIFSQMCKDI 
AACGAGATCAGCGAGATACAACGCGCCGGTTACCTAGAAACATATTOAGGAGA ^ ^ 

13321 . 

NEISEIQRAGYLETYLGDGQ 
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GC m GAC\C"GATATATTTTTTGATGTCTTAACCAACAACAAAGCAAAGGTAAGGTGGTTA 

13381 ----- — -+ + -~ + 13440 

CGACTGTGACTATATAAAAAACTACAGAATTGGTTGTTGTTTCGTTTCCATTCCACCAAT 

a ADTDI FFDVLTN-NKAKVRWL 

GTTAATAAAGACCATAGCGCGTGGTGTGGGATATTGAATGATTTGAAGTGGGAAGAGAGC 

13441 - — * - * + ~+ + 13500 

CAATTATTTCTGGTATCGCGCACCACACCCTATAACTTACTAAACTTCACCCTTCTCTCG 

a VNKDHSAWCGXLNDLKWEES 

AACAAGGAGAAATTTAAGGGGAGAGACATACTAGATACTTACGTTTTATCGTCTGATTAT 

13501 — + + + + + + 13560 

TTGTTCCTCTTTAAATTCCCCTCTCTGTATGATCTATGAATGCAAAATAGCAGACTAATA 

a NKEKFKGRDILDTYVLS SDY - 

ORF9 

C C AGGGTTTAAATGAAGTTGCTTTCGC TCCGCTATCTTATCTTAAGGTTGTCAAAGTCGC 

13561 + + * * + 13620 

GGTC C CAAATTTACTTCAACGAAAGCGAGGCG ATAGAATAGAATTCCAACZAGTTTCAGCG 

a P G F K * 

MKLLSLRYLILRLSKSL- 



c 



TTAGAACGAACGATCACTTGGTTTTAATACTTATAAAGGAGGCGCTTATAAACTATTACA 

13621 + + + + + + 13680 

AATCTTGCTTGCTAGTGAACCAAAATTATGAATATTTCCTCCGCGAATATTTGATAATGT 
RTNDHLVL I L I KEALINYYN- 

ACGCC^CTTTCACCGATGAGGGTGCCGTATTAAGAGACTCTCGCGAAAGTATAGAGAATT 

136 ai + * -— + + * 13740 

TGCGGAGAAAGTGGCTACTCCCACGGCATAATTCTCTGAGAGCGCTTTCATATCTCTTAA 
ASFTDEGAVLRDSRESIENF- 

TTCTCGTAGCCAGGTGCGGTTCGCAAAATTCCTGCCGAGTCATGAAGGCTTTGATCACTA 

13741 + + + + + + 13800 

AAGAGCATCGGTCCACGCCAAGCGTTTTAAGGACGGCTCAGTACTTCCGAAACTAGTGAT 
LVARCGSQNSCRVMKALITN- 

ACACAGTCTGTAAGATGTCGATAGAAACAGCCAGAAGTTTTATCGGAGACTTAATACTCG 

13801 + + + + + + 13860 

TGTGTCAGACATTCTACAGCTATCTTTGTCGGTCTTCAAAATAGCCTCTGAATTATGAGC 

TVCKMS IETARSF IGDLILV- 

TCGCCGACTCCTCTGTTTCAGCGTTGGAAGAAGCGAAATCAATTAAAGATAATTTCCGCT 

13861 + + + * + + 13920 

AGCGGCTGAGGAGACAAAGTCGCAACCTTCTTCGCTTTAGTTAATTTCTATTAAAGGCGA 

ADSSVSALEEAKS IKDNFRL- 

TAAGAAAAAGGAGAGGOIAGTATTATTATAGTGGTGATTGTGGAT^ 

13921 + + + + +~ + 13980 

ATTCTTTTTCCTCTCCGTTCATAATAATATCACCACTAACACCTAGGCTGCAACGCTTTC 

RKRRGKYYYSGDCGSDVA KV- 

TTAAGTATATTTTGTCTGGGGAGAATCGAGGATTGGGGTGCGTAGATTCCTTGAAGCTAG 

13 981 + + + +— + + 14040 

■ AATTCATATAAAACAGACCCCTCTTAGCTCCTAACCCCACGCATCTAAGGAACTTCGATC 

KY ILSG ENRG LGCVDS LKLV- 
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TTTGCGTAGGTAGACAAGGAGGTGGAAACGTACTACAGCACCTACTAATCTCATCTCTGG 

1 4 °41 — + + + - 14100 

AAACGCATCCATCTGTTCCTCCACCTTTGCATGATGTCGTGGATGATTAGAGTAGAGACC 
CVGRQGGGNVLQHLLISSLG- 

ORP10 

G7TAAAGCATCATGGACCTATCGTTTATTATTGTGCAGATCCTTTCCGCCTCGTACAATA 

14101 + + + + + + 14160 

CAATTTCGTAGTACCTGGATAGCAAATAATAACACGTCTAGGAAAGGCGGAGCATGTTAT 
* MDLSFIIVQIL'SASYNN- 

ATGACGTGACAGCACTTTACACTTTGATTAACGCGTATAATAGCGTTGATGATACGACGC 

14161 + + t -t- + + 14220 

TACTGCACTGTCGTGAAATGTGAAACTAATTGCGCATATTATCGCAACTACTATGCTGCG 
DVTALYTL INAYNSVDDTT R — 

GCTGGGCAGCGATAAACGATCCGCAAGCTGAGGTTAACGTCGTGAAGGCTTACGTAGCTA 

14221 + + ■+■ + * + 14280 

CGACCCGTCGCTATTTGCTAGGCGTTCGACTCCAATTGCAGCACTTCCGAATGCATCGAT 
WAAINDP QAEVNVVKAYVAT- 

CTACAGCGACGACTGAGCTGCATAGAACAATTCTCATTGACAGTATAGACTCCGCCTTCG 

14281 + + + + + + 14340 

GATGTCGCTGCTGACTCGACGTATCTTGTTAAGAGTAACTGTCATATCTGAGGCGGAAGC 
TATTELHRT ILIDS IDSAF A - 

CTTATGACCAAGTGGGGTGTTTGGTGGGCATAGCTAGAGGTTTGCTTAGACATTCGGAAG 

14341 + + + + + + 14400 

GAATACTGGTTCACCCCACAAACCACCCGTATCGATCTCCAAACGAATCTGTAAGCCTTC 
YDQVGC LVG IARGLLRHS ED- 

ATGTTCTGGAGGTCATCAAGTCGATGGAGTTATTCGAAGTGTGTCGTGGAAAGAGGGGAA 

14401 + + + + + + 14460 

TACAAGACCTCCAGTAGTTCAGCTACCTCAATAAGCTTCACACAGCACCTTTCTCCCCTT 
VLEVIKSMELFEVCRGKRGS- 

GCAAAAGATATCTTGGATACTTAAGTGATCAATGCACTAACAAATACATGATGCTAACTC 

14461 + + + + + + 14S20 

CGTTTTCTATAGAACCTATGAATTCACTAGTTACGTGATTGTTTATGTACTACGATTGAG 
KRYLGYLSDQCTNKYMMLTQ- 

AGGCCGGACTGGCCGCAGTTGAAGGAGCAGACATACTACGAACGAATCATCTAGTCAGTG 

14521 + + + + «»■ + 14580 

TCCGGCCTGACCGGCGTCAACTTCCTCGTCTGTATGATGCTTGCTTAGTAGATCAGTCAC 
AGLAAVEGADI LRTNHLVSG- 

GTAATAAGTTCTCTCCAAATTTCGGGATCGCTAGGATC 

14581 + + + + + + 14640 

CATTATTCAAGAGAGGTTTAAAGCCCTAGCGATCCTAC^ 

NKFSPNFG IARMLLLTLCCG- 

GAGCACTATAAAAATGTTATGTTGTTCAGCCAGTGTCAAATTTTCAAACGGGTTACA 

14641 + + + + + + 14700 

CTCGTGATATTTTTACAATACAACAAGTCGGTCA^^ 
A L * 
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ATCGC^nC^TATTTGCGCATGTTTGTTAGCGGTGCTAATTGTTAGCTTTTGTAGAAGGCG 
Q1 + + + + + 14760 

ORF11 

ATGAGGCAC^TAGAAAAACCCATCAGAGTAGCGGTACACTATTGCGTCGTGCGAAGTGAC 

14761 + + + + + + 14820 

^CTCCGTGAATCTTTTTGGGTAGTCTCATCGCCATGTGATAACGCAGCACGCTTCACTG 
MRKLEKPIRVAVHYCVVRSD - 

GTTTGTGACGGGTGGGATGTATTTATAGGCGTAACGTTAATCGGTATGTTTATTAGTTAC 

1482 1 + + + + + + 14880 

CAAACACTGCCCACCCTACATAAATATCCGCATTGCAATTAGCCATACAAATAATCAATG 

VCDGWDVFIGVTLIGMFISY 

TATTTATATGCTCTAATTAGCATATGTAGAAAAGGAGAAGGTTTAACAACCAGTAATGGG 

+ + + + + + 14940 

ATA^TATACGAGATTAATCGTATACATC 

YLYALISICRKGEGLTTSNG - 
TAAAAATCCTTCAATAAATTTGAAATAAACAAAAGTAAGAAAAATGAAATAATTAGGCTA ^ 

14941 attttta^^ 

* 

gtctttttgttcgtctttcgcttttgtagaataggttttatttcgaggtaagatgactaa 

15001 + j. + + + + 15060 

^gaaaaa™cag^ 

actctacctcacggtttaatactctgatatttgtaaaattagtccgtaaagtcagatagt ^ ^ ^ 

15061 tca^totagtgcc^ 

gatattatattagtatagtataataaa^ 

15121 + + + + + + 15180 

CTATAATATAATCATATCATATTATTTGCGGTTTTAGGTTAGTTTCAAACCCTGGATC c g 



gggcctcttatgaggctaacttatcgacaataagttaggtccgccac 
cccggagaatactccgattgaatagctgttattcaatccaggcggtg 
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TABLE 5 



Amino Acid Sequence Alignment for Helicases Encoded by 
GLRaV-3, BYV, ~CTV and LIYV, 



KA) la . 

BYV_HEL FTFTNLSANV LLYEAPPGGG KTTTLIXVFC ETFSX-VNSL I LTANXS SHE 

CTVJTEL LTFTNEEHSL TVYEAPPGGG KTHSLVNSYA DYCVK.VSCL WTANKNSQT 

GLRaV3_HEL VGESFKSFEY KCYNAPPGGG KTT MLV DEFVKSPNST ATTTANVGSS 

liyvIhel MVRRPDVNGL KxYNKPPGAG ktttxaklms kdlknkvkcl alsytkvgrl 

CONSENSUS — Y-aPPGaG KTt d-f-k-v— 1 Jc 

BYV HEL EILAKVNRXV LD...EGDTP LQTRDRILTI DSYLMNNR.G LTCKVLYLDE 

CTvIhEL EISQRISNEL MGRKLAAKYV TDAASRVFTV DSYLMNHL . R LTTQLLFIDE 

GLRaV3_HEL EDXNM A VKXR...DPN LEGLNSATTV NSRvA/NFIVR GI1YKRVLVDE 

LIYV_HEL ELIDKLKXDG IEKP...EKY VKTYDSFLMN NDNILEIV NLYCDE 

CONSENSUS E d — — ltv -s — mn ly-DE 



BYV_HEL CFMVHAGAAV 

CTV.HEL CFMVHAGAIG 

GLRaV3_HEL VYMMHQGLLQ 

LIYV — HEL VFMMHAGKFL 

CONSENSUS - f M-HaG 



III 

ACIEFTKCDS AXLFGDSRQI 
AWEFTSCKA WFFGDSKQI 
LGVFATGASE GLFFGDINQI 
TLLTKIAYQN GYCYGDVNQI 

c ffGD— QI 



RYGRCSELDT AVLSDLNRFV 
HYIHRNDLGV SFVADIDAFI 
PFINREKVFR KDCA. .VFV? 
PFXNRDPYTP AYLS..REFF 



IV 

BYVJiEL DDESRVYGEV SYRCFWDVCA 
CTV_HEL QPEHRIYGEV SYRCPWDICE 
GLRaV3_KEL KKESWYTSK SYRCPLDVCY 
liyvIhel RKQDLNYDTY TYRCPLDTCY 

CONSENSUS — e — vY sYRCP-DvC- 



WLSTF YPKTVAT TNLVSAGQSS 

WLSEF YPRHVAT ANVGSIGKSS 

LLSSMTVRGT EKCYPEKWS GKDK.PWRS 
LLSNLKDEMG NIIYAGGVKN VNEVYPTIRS 

-LS-E Yp--V n S 



BYV_KEL MQVREIESVD DVEYSSEFVY 

CTV_HEL VSIEEXNGCD DVPYDKAAKY 

GI*RaV3^HEL LSKRPIGTTD DVAEINADVY 

LIYVJEL LNLFGXNWG EVPVEYNAKY 

CONSENSUS 1 1 d dV Y 

V _ 

BYV.KEL LTVHEAQGET YRKVNLVRTK 

CTV_HEL NTVHEVQGET YKRVRLVRFK 

GLRaV3_HEL MTVHEAQGKT FSDWLFUTK 

LIYV_HEL STVNEAQGCT FSEVNLVRLV 

CONSENSUS -TVMaQG-T V-LvR-k 



LTMLQSEKXD LLKSFGK. -R SRSSVEKPTV 
IVYTQAEKND LQKHLGRLTV GRNKV.VPIV 
LCMTQLEKSD MKRSLKGKGX . ETP . . . . .V 
LTFTQDEKLN LQRHIDSQGG CRN A V 

1 — tQ-EK-d 1 1 r V 

VI 

FQEDDPFRSE NHITVALSRH VESLTYSVLS 
YQEDTPFSSK NHIWALTRH VDSLVYSVLT 
KADDSLFTKQ PHILVGLSRH TRSLVYAALS 
QFDNPVMSDI NQFWAISRH TTTFKYrTPH 

— d— £ nhi-ValsRH — sl-Y--l- 



BYV_HEL SKRDDAIAQA I 

CTV_HEL SRRYDDTATM I 

GLRaV^KEL SXLDDKVGTY I 

LIYV_HEL SRLNDRVSNA X 



CONSENSUS S D-v — 



- I 
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TABLE 6 





Heiicase 


RdRp 


p5K 




HSP70 


HSP90 


CP 




Virus 


nt 


aa 


Dt 


aa 


nt 


aa 


nt 


aa 


Dt 


aa 


nt 


aa 


BYV 


37.7 


38.4 
(58.1) 


44.5 


412 

(61.0) 


42.0 


30.4 
(47.8) 


43,5 


28.6 
(48.0) 


40.5 


21.7 
(51.0) 


41.5 


20.3 
(43.7) 


crv 


45.3 


36.3 
(55.2) 


44.0 


40.1 
(62.2) 


42.8 


20.0 
(48.9) 


43.7 


28.7 
(49.3) 


38.6 


17.5 
(43.5) 


40.3 


20.5 
(41.9) 


LIYV 


44,9 


32.4 
(53.5) 


46.2 


35,9 
(56.4) 


45.8 


17.9 
(46.2) 


43.9 


282 
(46.9) 


39.3 


16.7 
(36.8) 


36.3 


17.8 
(41.1) 



Nucleotide (nt) and amino acid (aa) sequence similarity was 
calculated from perfect matches after aligning with the GCG 
program GAP; the percentages in parentheses are the 
percentages calculated by the GAP program, which employs a 
matching table based on evolutionary conservation of amino 
acids (Devereux et al . , Wirlpir Acridfi Res., 12:387-395 
(1984) , which is hereby incorporated by reference) . The 
sources for the BYV, CTV, and LIYV sequences were, 
respectively, Agranovsky (1994) , Karasev (1995) , and Klaassen 
(1995), which are hereby incorporated by reference. 



TABLE 7 



Sample # 


Accession # 


ELISA * 


RT-PCR 


Indexing 


1 


476.01 


1.424 (+) 


+ 


+ 


2 


447.01 


0.970 (+) 


+ 


+ 


3 


123.01 


1.101 (+) 


+ 


+ 


4 


387.01 


>1.965 {+) 




+ 


5 


80.01 


>2.020 (+) 






6 


244.01 


>2.000(+) 




+ 


7 


441.01 


>2.000(+) 


+ 


+ 


8 


510.01 


0.857 (+) 




+ 


9 


536.01 


0.561 (+) 






10 


572.01 


>2.000(+) 


+ 




11 


468.01 


>2.000 ( + ) 


+ 


+ 


12 


382-01 


>2.000 ( + ) 




+ 


13 


NY1 


0.656 (+) 


+ 


+ 


14 


Healthy 


0.002 (-) 







Plus (+) and Minus (-) represent positive and negative reactions, 
respectively. For ELISA an OD 405IO that was at least twice higher 
than a healthy control, and more than 0.100 was regarded as 
positive . 
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TABLE 8 

Amino Acid Seauence Alignment of RWA- dependent RNA 
Pol?£eraie (RdRP) of GLRaV-3, BYV, CTV and LIYV. 



BYV_RdRp 
C7V_RdRp 
GLRaV3_RdRp 
LIYV_RdRp 

CONSENSUS 



BYV_RdRp 
CTV_RdRp 
GLRaV3„RdRp 
LIYV_RdRp 



ITTFKLMVXR DAKVKLDSSC LVKHPPAQNI MFKRXAVNAI FSPCFDEFKN 
ISNFKLMVKR DAXVKLDDSS LSXHPAAQNI MFHKKFINAI FSPCFDEFKN 
LTSYTLMVKA DVKPKLDNTP LSKYVTGQNI VYKDRCVTAL FSCIFTACVE 
FKTLNLMVKG ETKPKMDLST YDSYNAPANI VYYQQIVNLY FSPIFLECFA 



f-LMVX- d-K-KlD-s- 1-k- 



-qNI — h vna- FSp-F-e 

IV 

RVTTCTNSNI VFFTEMTNST LASIAXZMLG . SEHVYNVGE IDFSKFDKSQ 
RVLSSLNDNI VFFTEMTNAG LAEIIRRIIG . DDDNLFVGE VDFSKFDKSQ 
RLKYWDERW LFYHGMDTAE LAXALRNNLG .DIRQYYTYE LDISKYDK3Q 
RLTYCLSDKI VLYSGMNTDV LAELIESKLP LGLNAYHTLE IDFSKFDKSQ 



CONSENSUS R- 



-d-i vf M LA Ig y E iDf SKfDKSQ 



BYV_RdRp 
CTV_RdRp 
GLRaV3_RdRp 
LIYV_RdRp 

CONSENSUS 



BYV_RdRp 
CTV_RdRp 
GLRa V3 _RdRp 
LIYV_RdRp 



DAFIKSFERT LYSAFGFDED LLD.VWMQGE YTSNATTLDG QLSFSVDNQR 
DLFIKEYERT LYSEFGFDTE LLD.VWMEGE YRARATTLDG QLSFSVDGQR 
SALMKQVEEL ILLTLGVDRE VLS.TFFCGE YDSWRTMTK ELVLSVGSQR 
GTCFKLYEEM MYKMFGFSPE LYDRDFKYTE YFCRAKA.TC GVDLELGTQR 



— f-K-yE— 

V 

KSGASNTWIG 
RSGGSNTWIG 
RSGGANTWLG 
RTGSPNTWLS 



ly« fGfd-e lid gE Y a-tl 1 — sv QR 



VI 



NSIETLGILS MFYYTNRFKA 
NSLVTLGILS LYYDVSKFDL 
NSLVLCTLLS WLRGLDYSY 
NTLVTLGMML SSYDIDDIDL 



LFVSGDDSLI FSESPIRNSA 
LLVSGDDSLI YSSEKISNFS 
IWSGDDSLI FSRQPLDIDT 
LLVSGDDSLI FSRKHLPNKT 



CONSENSUS rsG — NTW-G Nslvtlg-ls 



-f— 11VSGDDSLI fS n— 



BYV_RdRp 
CTV_RdRp 
GLRa V3 _RdRp 
LIYV_RdRp 

CONSENSUS 



DAMCTELGFE 
SEICLETGFE 
SVLSDNFGFD 
QEINKNFGME 



VII 

TKFLTPSVPY FCSKFFVMTG 
TKFMSPSVPY FCSKFWQTG 
VKIFNQAAPY FCSKFLVQVE 
AKYIEKSSPY FCSKFIVELN 



VIII. 



HDVr FVPDPY KLLVKLGAS - 
NKTCFVPDPY KLLVKLGAP . 
DSLFFVPDPL KLFVKFGAS . 
GKLKVTPDPI RFFEKLSIPI 



•fGfe 



_ K f s-py FCSXF-V fvFDP- kl-vKlga- 



BYV RdRP KDEVDDEF LFEVFTSFRD LTKDLVDERV IELLTHLVHS KYGYESGDTY 

CTV~RdRp "qnkltdve LFELFTSFKD MTQDFGDQW LEKLKLLVEA KYGFASGTTM 

GLRaV3 "~RdRp " 1 KTSDID . L LHEIFQSFVD LSKGFNREDV IQELAKLVTR KYK.HSGWTY 

^SviSp RQroFVNGSV VKZRFISFKD LMKEYDNDVA VXniDEAVCY RYSIPVGCSY 

CONSENSUS d~ 1-E-F-Sr-D 1-kdf-e-v i~l— IV— kY sG-ty 



BYV_RdRp 
CTV_RdRp 
GLRa V3 _RdRp 
LIYV RdRp 



AALCAIHCIR SNFSSFXXLY 
PALCAIKCVR SNFLSFERLF 
SALCVLHVLS ANFSQFCRLY 
AALCYIKCCM SNFVSFRR.IY 



CONSENSUS 



-ALC-iHc-- sNF-sF-rly 
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TABLE S 

Nucleotide Sequence Comparison of GLRaV-3 and LIYV in 
Vicinity of Frameshift 



GUlaV-3_HEL 

CLRaV-3 (ntl 
LIYV (nt) 



sp QSVSDAI,L 

TCg cct CXA TCa CTa tee GAc scT^^e Ac. cgT tc 9 cce ecg ccg ctT GCt £c <L CTa «T 

r t r r r r c r r r r a a- * acT atc acc tta ^ ccc « s * £ « 

LlW_RdRp KT IILDSAFDVY 



TABLE 10 



Amino Acid Sequence Alignment of Small HydroDhobic 
Transmembrane Proteins of GLRaV-3 (p5k) and BYV (dSX) 
CTV (p6K) and LIYV (pSK) . P ' ' 



1 54 

BYV_p7K KDCVLRSVLL LAFGFLIdF IPCLWFTWF VYKQILmTT AQSNEARHHH STW 

LIYV_P5K MST7.T ,7TL MSILVWnFT TT.7TT.7wm SEVNIPNKSR F* . . . 

GLRaV3_p5K KDD HCQAILLLW DFVFVIILLL VLTFWFRLQ QSSTIOTGLS. TV* . . 

CTV_p6K MDC/IQGFLT FLVG1AVKZA F^GLIIIVIT IYRCTTXPVR SASFYUTHJiT V* - . . 

CONSENSUS f il-f Ivi-i s 
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TABLS 11 



Amino Acid Sentience Alignment of ASP7G -related protein 
of GLRaV-3 with BYV (pS5K) , CTV (p€5K) and LIYV (p62K) . 



BYV _p65 . .MWFGLDF G7TFSSVCAY VGEELYLFKQ RDSAYIPTYV FLHSDTQEVA 

CTV_p65 . .MVLLGLDF GTTFSTVAMA TPSELVILXQ SNSSYIPTCL LLHAZPNSVS 

GLRaV3_p59 . . .MEVGIDF GTTFSTICFS PSGVSGCTPV AGSVYVETQI FIPEGSSTYL 

LIYV o62 MRDCXVGLDF GTTFSTVSTL VNNSMYVLRL GDSAYIPTCI AITPGGEAI . 



CONSENSUS GlDF GTTFStv- 



-1—1- 



-S-YipTci f v- 



BYV _p65 FGYDAEVLSN DLSVRGGFYR DLKRWIGCDE ENYRDYLEKL KPHYKTELLK 

CTV_p65 YGYDAEYLAA S.GESGSFYK DLXRWVGCTA KNYQTYLHKL SPSYKVTVKE 

GLRaV3_p59 IG . KAAGKAY RDGVEGRLYV NPKRWVGVTR DNVERYVEXL KFTYTVKM. . 

LIYV_p62 IGGAAEVLSG DDTPHCFFY . DLXRWVGVDD NTFXFAMNKI RPKYVAELVE 



CONSENSUS -G — Ae-1- 



-g-fY- dlKRWvG ny— yl-Kl -P-Y 1- 



BYV_p65 VAQSSKSTVK LDCYSGTVPQ NATLPGLIAT FVKALISTAS EAFKCQCTGV 

CTV_p65 FGTKSVPVPY LSPLNNDLGL SVALPSLIAS YAKSILSDAE RVrNVSCTGV 

GLRaV3_p59 . . .DSGGALL IGGLGSGPDT LLRWDVTCL FLRALILECE RYTSTTVTAA 

LIYV_p62 GEVY LTGXNKGFSI KLSVKQLIKA YIETIVRLLA SSYSLRVTDL 



CONSENSUS 



-IX- 



BYV_p65 
CTV_p65 
GLRaV3_p59 
LIYV_p62 

CONSENSUS 



ICSVPANYNC LQRSFTESCV NLSGYPCVYM VNEPSAAALS ACSRIKGATS 
ICSVPAGYNT LQRAFTQQSI SMSGYSCVYI INEPSAAAYS TLPKLNSADK 
WTVPADYNS FKRSFWZAL KGLGIPVRGV VNEPTAAALY SLAKSRVEDL 
NQSVPADYKN AQRLAARSVL KALSFPCRRI INEPSAAAVY CVSRYPNYNY 



-sVPA-Yn- lqR-f gypc — i -NEPsAAA- 



BYV_pS5 
CTV_p65 
GLRaV3 _p59 
LIYV_p62 

CONSENSUS 



PVLVYDFGGG TFDVSVTSAL NNTFWRASG GDMNLGGRDI DKAFVEHLYN 

YLAVYDFGGG TFDVSXVSVR LPTFAVRSSS GDMNLGGRDI DKXLSDKIYE 

LLAVFDFGGG TFDVSFVKXit GNILCVIFSV GDNFLGGRDI DRAIVEVIKQ 

FL.VYDFGGG TFDVSLIGXY KSYVTVIDTE GDSFLGGRDI DKSIEDYLVG 



-1-VyDFGGG TFDVS f-V— s- GD — LGGRDI Dk- 



BYV_p65 KAQ. . .LFVN YKTDISFLXE SLSKKVSFLN FPWSEQGVR VDVLVNVSEL 

CTV_p65 MAD...FVPQ KEXNVSSLXE ALSLQTDPVK YT.VNHYGI4S ETVSIDQTVL 

GLRaV3_p59 KIKGXASDAK LGIFVSSUKE DLSNNNAITQ HLIFVEGGVE V.VDLTSDEL 

LIYV_p62 KYNIKKVIP- .ATYI*ALIKE E.CNNTNKSI FTILFUDGSV QWEFSKSEL 



CONSENSUS k- 



-IKE -Is- 



-eL 



BYV_p65 
CTV_?65 
GLRaV3_p5 9 
LIYV_p62 



AEVAAPFVER TIXTVKEVY. 
REIASVFIMR TIDILTQV. . 
DAXVAPFSAR AVEVFKTG? 



.EXYCSSMRL EPNVKAKLLM VGGSSYLPGL 
. .KVXSSMFE SQSL- .KLW VGGSSYLPGL 
.DNFYPDPVT A VM TGGSSALVKV 



EKCVRPFVER SIXLUIDVW RNXLTSGV. 



, . . XYM VGGSSLLQPV 



CONSENSUS 



— j.— pFv-R -i-i v k— s °* VGGSS-L 



WO 97/22700 



PCT/US96/20747 



139 



SYV_p65 
CTV _p65 
GLRaV3_p59 

LIYV_p62 QDMTOSYAST KGLTLVADQD MRSAVSYGCS VLKX.LEDNX EIVYIDCNSK 



LSRLSSIPFV DEC.L.VLPD ARAAVAGGCA LYSACLRNDS PMLLVDCAAK 
LDALATVPrV SGI.V.PVED ARTAVARGCA LYSECLEGRS KALLIDCITH 
RSDVANLPQI SXV.VFDSTD FRCSVACGAX VYC3TLAGNS GLHLVDTLTN 



CONSENSUS 



o-v — 



D -R-aVa-Gc- 



— 1-Dc— 



BYV_p65 NLSISSKYCE SIVCVPAGSP IFFTGVRTVN MTGSNASAVY SAALFEGDFV 

CTV_p65 HLSVTTFSAD SVWAAAGS? IPFEGERKLT LRXCVSTSNY QARMFEGDYZ 

GLRaV3_p59 TLTDEWGLQ FWTFPKGS? IPCSYTHRYT V GGGDV VYGIFEGE . . 

LIYV_p62 PLSDISFNCD PEPIIPJCPMS IPYTHTVKtffi KDRPLKT. . . IVNTYEGSNL 



CONSENSUS -Ls d -wi gsp IPf- 



-fEGd- 



H 

BYV_p65 KCRLNXRIFF GDWLGNVGV TGSATRTVTL TLEINVSSVG TISFSLVGPT 
CTV_p65 KVFRNERIYA ASVSLFTLGV NWSVPNDVEM TLVTKVDSMG KVEFYLKGPS 

GLRaV3_p59 NNRAFL NEPTFRGVSX RRGDPVETCV A. QFNLSTDG TVSVTVNGEE 

LIYV_p62 FMPENDWLIS SNXNTTDFAK VGEZY SXVYEYDIDG IITLKIRNEV 

CONSENSUS N-r-f- — V-l e- G f g — 



BYV_p65 GVKXLIGGNA AYDFSSYQLG ERWADLKXK NSDKVXLIHA LTYQPFQRXK 

CTV__p65 GELVNVQGTS HYDYAGMPHP TRKLVRLSDY NVNSAALVLA LTLTREXREX 

GLRaV3_p59 VKNEYLVPGT TNVLDSL VYKSGRE DLEAKAIPEY LTTLNILHDK 

LIYV_p62 TGXMFTLPNS FTXSDNIKPI TFKLTQLSNT D . DLATLTSL LGYKDKNFER 

CONSENSUS 1 d 1— Lt ek 



BYV_p65 LTDGDKALFL KRLTADYRRE ARKFSSY DDAV LNSSELLLGR 

CTV j 65 FLLRT. . .LF DTLLADLRKT A.SLSEYSKK YPITRNDIDV VSSR. . .MGI 

GLRaV3_p59 AFTRRNLGNK DKGFSDLRIE ENFLXS AVDTDTI LNG* 

LXYV_p62 FYG L FNVFTILIKE IDKLGGFKTL YRRLKSHNAN F 

CONSENSUS f 1 dlx-e l--y v 1 



BYV_p65 IIPKILRGSR VEKLDV* 

CTV_p65 WSXVLRGSD LERIPL. 

GI*RaV3_p59 

LIYV_p62 1 



CONSENSUS 
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TABLE 12 

Amino Acid Sequence Alignment of HSP90-like Protein 
(p55) of GLRaV-3 with p61K of CTV, p59 of LIYV and P64 
of BYV. 



BYV_p64 
CTV_p61 
LIYV_P59 
GLRaV3_p55 

CONSENSUS 



115 VGCKFNX Q SVTEFVKKXNGNVAZ P SLVEHCWSL SHSCGZL I N P KDTKRFV 
1 0 8"vGCKFTI^VESTLMSRGH3DFADLAAVEHSWCLSNSCSRLLSSTEID 

131~EGCSFTEQQVVEKYPQVDSLVAXIL YRVCNSLGKLLDLKDFENKN 

114~VDSNLPKOTREDIME. . ASRRLS P SD AAFCRAVS VQ VG KYVDVTQNLES T ' 



vgc- 



.£ v -e a w — sns-g-1 



-d 



BYV_p64 
CTV_p61 
LIYV_P59 
GLRaV3_p55 

CONSENSUS 



SLIFKGKDLAESTDEAIVS . . SSYLDYLSHCLNLYETCNLSSNSGKKSLY 
TLVF TKNFDSNISG. .VT. . TKLETYLSYCISLYKKKCM - KDDDYFKTLI 
ISGF^INT/ QDSPTVADDN. . ES . NDFFRECVNDQRYYSSLSGSKLGKAX 
^PL^VMEIKXHRGSAHVSLPKWSAY^ 



-yl- 



-nl- 



BYV_p64 
CTV_p61 
LIYV_P59 
GLRaV3_p55 

CONSENSUS 



DEFLKKVTDYXi . . .ENSDLEYRSPSDNPLVAGILYDMCFEYNTLKSTYLK 
LPHFNCI^KVL. . . ASLGLFYEKHADNPLLTGMLIEFCLENKVYVSTFKV 
LEANAyiFKIIiLKSASGEFTDlDRLSRNPLAISKFMNLYTNHVTDSETFKS 
SAYATDSMAFLVKMLPLT AREQWLKDVLGYLLVRRRPANFSYDV 



-npl- 



-1— 1c 



BYV_p64 
CTV_p61 
LIYV_PS9 
GLRaV3_p55 

CONSENSUS 



NXESFIXZ^LSLYLPLLSEVFSMNWERPAPDVRLLFELDAAELLLICVPTXN 
NLDNVRI^XVLFVVLTV^^ 

KFEALKSIKTPFASFIKKAFGIR LNFEDSKIFYALPKER 

RVAWVYDVXATLKLVTRLFFNKDT PGG IKDLICPCV? IE S FD P FHELS . . . 

i f d f— d-f« lp — 



BYV_p64 
CTV_p61 
LrYV_P59 
GLRaV3_p55 

CONSENSUS 



MHDST. - .FLYra^YLESYFEDDSNELIKVKVDSLL 
I HDTM VWGNQIRQLEYWESDALDDLSQHVDLRL 
QSDVLSDDrC^^IVMAASrmSDNNYLPERVD^ 
. . SYFSRLSYEMTTGKGGKICPEIAEKL 



-vd— 1 
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TABLE 13 

Amino Acid Seauence Alignment of the HSP90-related 
Protein of GLRaV-3 with BYV p64K, CTV p61K and LIYV 
p59K. 



MTTRFSTPAN YYWGELFRHF FGGQEW KNLMSE AASVSRPRYS 

MSSH HVWGSLFRXF YGEAIW KEYLSE STRNFDERNV 

. .MLNDRXAV TCFQTLLKKS NVKHEMEQTN NYIVNNLADI NHNTFPALAG 
MDK YIYVTGI..L NPNEARDEVT SWNKGYIG? GGRSFSNRGS 

CONSENSUS — w— If— f w -k — f — r — 



BYV_p61 
CTVjpfil 
LIYV_P59- 
GLRaV3 _p55 



BYVjpSl S.DFRFSDGV ILSRXTFGES TGES..FVRE FSLLLTFPKT YEVCKLCGVA 

CTV_p61 SLDHTLSSGV WRRQSLLNA PQG7. -FENE LALLYNSWT NDFVELTGMP 

LIYV_P59 SVRXDFNSDY YISGGQIWS PKDSNAYVKL LIVYLKYCYI N.YSAKTKYP 

GLRaV3_p55 KYTWWEN. . . .SAARISGF TSTSQSTIDA FAYFL LKGGLTTT 

CONSENSUS s f — s s— fv 11 

BYV__p61 MELALNGMN . . RLSDYNVSE FN IV DVKTVGCKFN IQSVTEFVKX 

CTV_p6l LKSLMTGISD RKVPD E LI SV DPHEVGCRFT LNDVESYLMS 

LIYV_P59 PQSLLAVLDY DSFXAKWVKY LDKSLTDYLD DNKTEGCSFT EQQWEXYPQ 

GLRaV3_p55 LSNPINCENW VRSSXDLSAF FRTLIKGKIY ASRSVDSNLP KKDRDDIME . 

CONSENSUS 1 1 d vgc-f- v-e 



BYV_p61 
CTVJP61 
LIYV_P59 
GLRaV3_p55 



INGNVAEPSL VEKCWSLSNS CGELINPKDT KRFVSLIFKG KDLAESTDEA 
RGEDFADLAA VEHSWCLSNS CSRLLSSTEI DANKTLVF.T KNFDSNISG . 

VDSL VAKIL YRVCNS LGXLLDLKDF ENKNISGFEI NTAQDSPTVA 

.ASRRLSPSD AAFCRAVSVQ VGKYVDVTQN LESTIVPLKV MEIKKRRG3A 



.CONSENSUS 



-sns -g-1- 



BYV_p61 IVS . . SSYLD YLSHCLNLYE TCNLSSNSGK KSLYDEFLKH VTDYL...EN 

CTV_p61 -VT. -TKLET YLSYCISLYK KHCM . KDDDY. FNLILPMFNC LMKVL. . - AS 

LIYV_P59 DDN..ES.ND FFRECVNDQR YYSSLSGSKL GKAKLEANAY IFKILLKSAS 

GLRaV3_p55 HVSLPKWSA YVDFYTNLQE LLSDEVTRAR TOTVSAYATD SMAFLVKMLP 

CONSENSUS -v yi — c-nl . L 

SDLEYRSPSD NPLVAGILYD MCFEYNTLKS TYLKNIESFD CFLSLYLPLL 
LGLFYEKHAD NPLLTGMLIE FCLENKVYYS TFKVNLDNVR LFKSKVLPW 
GEFDIDRLSR NPLAISXFMN LYTNHVTDSE TFKSKFEALK SIKTPFASFI 
LT EQWLKDVLGY LLVRRRPANF SYDVRVAWVY 0VIATLKLVT 

CONSENSUS — 1 npl 1-- lc t e i 



BYV_p61 
CTV_p61 
LIYV_P59 
GLRaV3_p55 



r 

BYV_p61 SEVFSMNWER PAPDVRLLFE LDAAELLLKV PTINMHDST. . -FLYKNKLR 

CTV_p61 LTVWDISEPD DPVDERVLIP FDPTDFVLDL PKLNIKDTM. . .WVGNQIR 

LIYV_PS9 ICKAFGIR LN FEDSKIFYAL PKERQSDVLS DDMMVESIVR 

CLRaV3_p55 RLrFNKDTPG GIXDLKPCVP IESFDPFHEL S SYFS 

CONSENSUS — f — d f — d-f — 1 P d r 
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BYV_p61 YLESYFEDDS 

CTV_pSl QLEYWESDA 

LIYV_P59 DAA5F7WSD 

GLRaV3_p55 RLSYEMTTGK 

CONSENSUS -1 



NELIKVKVDS LLTRDNPEL. 
LDDLSQHVDL RLAADNPDL. 
NNYLPERVDR FVTQLLLELF 
GGXICFEIAE KLVRRLMEEN 

vd- -1 el- 



. KLAQRWV GrKCYYG 

■ RVGLRWA GMFVYYG 

PKTKASFPNK IMFGFLHYFA 
YKLRLT.PVM ALIIILVYYS 

-k w -g-l-Yy~ 



BYV_p61 
CTV_pSl 
LIYV_P5 9 
GLRaV3_p55 



II 

VFRTAQTRKV KRDAEYKLPP AL GE FVTNMSGVEE FF.EELQKKM 

VYRCWDRAV ERPTLFRLPQ KLLSQDDGES CSLHMGSVEA LF.NLVQKVN 
FNDTQ ESTIEIEGET LKISLKFITS YLRNAIQSQH 



LSTTNSKR. . 
IYGTNATRIK RRPDFLNVRI KGRVE. 



. KVSLRGVED 



. RAFRISEK 



CONSENSUS vy-t- 



BYV_p6l PSI...SVRR RFCGSLSHEA FSVFKRFGVG FPPITRLNVP VKYSYLNVDY 

CTV_jp 61 KDI...NVRR QFMGRHSEVA LRLYRNLGLR FPPISSVRLP AHHGYLYVDF 

LIYV_?59 PDYADSNIVR LWCNKRSNLA LGYFKSRNIQ LYLYS . . KYP RLLNYMRFDY 

GLRaV3_p56 RGINAQRVLC RYYSDLTCLA RRHYGIRRNN WKTLSYVD. . GTLAYDTADC 



CONSENSUS 



-v-r -fc- 



-s — A 1 f- 



— p Yl— Dy 



3YV _p61 YRHVKRVGLT 

CTV_p61 YKRVPDGAVT 

LIYV_P5 9 FKGLDMGXLT 

GLRaV3_p55 ITSKVRNTIN 

CONSENSUS y 1 



QDELTILSNX EFDVAEMCCE 
ADELESLRQL RSSVDVMCKD 
DEERLSIQTL RCITEDRS . E 
TADHASIIHY IKTNENQVTG 

-de — s e e 



REVALQARRA QR GEKP 

R.VSITPPPF NRLRRGSSRT 
GTLATHNDLN SWILRP. . . . 
TTLPHQL* . . 



BYV_p61 FQGWKGTKNE ISPHARSSIR VKKNNDSLLN ILWKDVGARS QRRLNPLHRK 

CTV_p61 FRGR.GARGA SSRHMSRDVA TSGFNLPYHG RLYSTS* 

LIYV_P59 . 

GLRaV3 _p55 . 

CONSENSUS ■ 



BYV_p61 H* 
. CTV_p61 

LIYV.P59 . . 

GLRaV3_p55 . . 



CONSENSUS 
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TABLE 14 

Nucleotide and Deduced Amino Acid Sequences of the 
Partial HSP90-Related Gene of GLRaV-3 Adapted for 
Cloning into Plant Expression Vectors . 



(5 * primer, 93-224) 
NcoX 
tacttacctagaacc ■ 




2AAGCGAGTCGACGACTA 
AT GGAAGCGAGTCGACG ACTAT CGCCATCG GACGC CGC CTTTTGC AGAGC AGT GT CGG TT 

9404 +- * + * + 

MEASRRL S PSDAAFCRAVSV - 

CAGGTAGGGAAGTATGTGGACG T AACG CAG AATTT AGAAAGTACG ATC GTGCCGTTAAGA 

QVGKYVDVTQNLEST IVPLR - 

GTTATGGAAATAAAGAAAAGACGAGGATCAGCACATGTTAGTTTACCGAAGGTGGTATCC 

VMEIKKRRGSAHVSL PKVVS 

GCTTACGTAGATTTTTATACGAACTTGCAGGAATTGCTGTCGGATGAAGTAACTAGGGCC 

AYVDFYTNLQ ELL SDEVT RA 

AG AAC CG ATACAGTTTCGGC AT ACG CT ACCGACTC TATGG CTTTC TTAGTTAAGATGTTA 

RTDTVSAYATDSMAFLVKML 

CCCCTGACTGCTCGTGAGCAGTGGTTAAAAGACGTGCrAGGATATCTGCTGGTACGGAGA 

PL-TA REQWLKDVLG YLLVRR - 

CG ACC AGCAAATTTTTCCTACG ACG TAAGAGT AGC TTGGG TAT ATGACGT GATCG CTACG 

RPANFSYDVRVAWVYDVIAT 

CTCAAGCTGGTCATAAG ATT GT T 1T TC A ACAAGCACACACCCGGGGGTATTAAAG ACTTA 



LKLVIRLFFNKDTPGGIKDL 
AAACCGTGTGTGCCTATAGAGTCATTCGACCCCTTTCACGAGCTT^ 

KPCVPIESFDPFHELSSYFS - 
AGGTTAAGTTACGAGATGACGACAGGTAAAGGG<^AAAGATATGCCCGGAG 

RLSYEMTTGXGGXICPEIAE - 
AAGTTGGTCC GC C GTCT AATGG AGG AAAACTAT AAGTTAAGATTG ACC CC AGTGATGG CC 

KLVRRLMEEMYKLRLTPVMA 
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IT AAT AATTATAC TG GT ATACT ACTCC ATTTAC GGC AC AAACG CT ACCAGGATTAAAAGA 

LIIILVYYSIYGTNATRIKR 
CGCCCGGATTTCCTCAATGTGAGGATAAAGGGAAGAGTCGAGAAGGTTTCGTTACGGGGG 

RPDFLNVRIKGRVEKVSLRG 
GT AGAAG ATCGTGCCTTTAG AATATCAGAAAAG CG CGGGAT AAAC G CT CAACG TG TATTA 

VEDRAFRISEKRGINAQRVL - 
TGTAGGTACTATAGCGATCTCACATGTCTGGCTAGGCGACATTACGGCATTCGCAGGAAC 

CRYYSDLTC LARRHYG IRRN 
AATTGGAAGACGCTGAGTTATGTAGACGGGACGTTAGCGTATGACACGGCTGATTGTATA 

NWKTLSYVDGTLAYDTADCI 
ACTTCTAAGGTGAGAAATACGATCAACACCGCAGATCACGCTAGCATTATACACTATATC 

TSKVRNTINTADHAS I IHYI 
AAGACGAACGAAAACCAGGTTACCGGAACTACTCTACCACACCAGCTTTAAAGCTGCGTG 

KTNENQVTGTTLPHQL* 
T AGTATGCG ACGATGTTT CT 



h h 10503 

ATCATACGCTGCTACAAAGA^^ 




-3££a££taggagttct 
Ncol 

(3 1 primer, 93-225) 
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TA3L2 15 

Nucleotide and Deduced Amino Acid Sequences of a PCR- 
amDlified Fragment of the GLRaV-3 Genome External and 
Internal Primes are underlined and their orientations 
are indicated by arrows. 



i — oii-uoi ~ "HIT 60 

:~TAGAAGGTTTC-rTrcrGTCCCrACrG-:AGTACCTTCGCTC.-.GC.G^^-^ 



-Gi» 



v'j's JI'L ? X X D ?. D D Z M £ A S * X - 
.„., _„ nKAC3C=GC=r -TTGCAGAGCAGTGTCGG7TCAGGTAGGGAAGTATGTGGA 

61 -1- I- ~ 120 

TAGCGGTAGCCTGCGGCGGA^^CGTCrCGTCACAGCCAAGTCCATCCCTTCATACACCT 

SPSDAAFCRAVSVQVGKYVD- 

rr~> a r^^ ^a^AGAAAG?ACGA7CST^C CGTrAAGAGTTATGGAAATAAAGAAAAG 

121 * (93-25) + H + "* 180 

<-_,,, ~CTTAAA^ r- ^~CATGC^GCACGGCAATTCTCAATACCTTTATTTCTTTTC 

v "t~T n'lestivplrvmeikkr- 

ACGAGGATCAGCACATGTTAGTTTACCGAAGGTGGTATCCGCTTACGTAGATTTTTATAC ^ 
^CCTAGTCGTCTACAA^ 

r'gsakv s l p kvvsayvdf yt - 

GAACT^GCAGGAATTGCTGTCGGATGAAGTAACTAGGGCCAGAACCGATACAGTTTCGG-C 

_ j. ^ + - 300 

2 4 1 C^GAACGTCTAACGA^^ 

NLQELLSDEVTR ARTDTVSA- 

ATACGC-ACCGACTCrATGGC-rrrC-rrAGT-AAGATGrrACCCCTGACTGCTCGTGAGCA 
ata(-g>..ai.^.v. ^ _. „r9T-A0< * 360 

301 tatgcgatggc?gagataccgaaagaatcaa5t^ 
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CLAIMS : 

1. An isolated DNA molecule encoding a protein or 
polypeptide of a grapevine leaf roll virus. 

2. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide is selected from a group 
consisting of a helicase, an RNA-dependent RNA polymerase, an 
hsp70- related, an hsp90-related, a coat protein or 
polypeptide, coat protein repeat, ORF8 (p21) , ORF9 (P20) , 
ORF10 (P20) and ORF11 (p7) . 

3. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is a helicase having a 
molecular weight of from about 146 to about 151 kDa, 

4 . An isolated DNA molecule according to claim 3 , 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO:2. 

5. An isolated DNA molecule according to claim 4, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO:l. 

6. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is an RNA-dependent RNA 
polymerase having a molecular weight of from about 59 to about 
63 kDa. 

7. An isolated DNA molecule according to claim 6, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 4. 



8. An isolated DNA molecule according to claim 7, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 3. 
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9. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is an hs P 70-related protein 
or polypeptide having a molecular weight of from about 57 to 
about 61 kDa. 

10. An isolated DNA molecule according to claim 9, * 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 6. 

11. An isolated DNA molecule according to claim 10, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 5. 

12. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is an hsp90-related protein 
or polypeptide having a molecular weight of from about 53 to 
about 57 kDa. 

13. An isolated DNA molecule according to claim 12, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 8. 

14. An isolated DNA molecule according to claim 13, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 7. 

15. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is a coat protein or 
polypeptide having a molecular weight of from about 33 to 
about 4 3 kDa . 

16. An isolated DNA molecule according to claim 15, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 10. 
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17. An isolated DNA molecule according to claim 16, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 9. 

18. An isolated DNA molecule according to claim 1 # 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 12. 

19. An isolated DNA molecule according to claim 18, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 11. 

20. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 14. 

21. An isolated DNA molecule according to claim 20, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 13. 

22. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 16. 

23. An isolated DNA molecule according to claim 22, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 15. 

24. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 18. 

25. An isolated DNA molecule according to claim 24, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 17. 
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26. An expression system comprising a DNA segment 
corresponding to a DNA molecule according to any of claims 1 
to 25 in a vector heterologous to the DNA molecule and wherein 
a nucleotide sequence encoding a protein or polypeptide of a 
grapevine leafroll virus is operatively linked to nucleotide 
sequences which direct the expression of said protein or 
polypeptide . 



27. A host cell transformed with a heterologous 
expression system according to claim 26 . 

28. A host cell according to claim 27, wherein the host 
cell is selected from the group consisting of Agrobacteriu/n 
vitis and Agrobacterium tumefaciens. 

29. A host cell according to claim 28, wherein the host 
cell is a grape cell or a citrus cell. 

30. A transgenic Vitis scion cultivar or rootstock 
cultivar comprising a DNA sequence encoding a protein or a 
polypeptide of a grapevine leafroll virus according to any of 
claims 1-25. 

31. A transgenic Vitis scion cultivar or rootstock 
cultivar according to claim 30, wherein the Vitis scion or 
rootstock cultivar is a scion cultivar selected from the group 
consisting of Vitis vinifra, Alden, Almeria, Anab-E-Shahi, 
Autumn Black, Beauty Seedless, Black Corinth, Black Damascus, 
Black Malvoisie, Black Prince, Blackrose, Bronx Seedless, 
Burgrave, Calmeria, Campbell Early, Canner, Cardinal, Catawba, 
Christmas, Concord, Dattier, Delight, Diamond, Dizmar, 
Duchess, Early Muscat, Emerald Seedless, Emperor, Exotic, 
Ferdinand de Lesseps, Fiesta, Flame seedless, Flame Tokay, 
Gasconade, Gold, Himrod, Hunisa, Hussiene, Isabella, Italia, 
July Muscat, Khandahar, Katta, Kourgane, Kishmishi, Loose 
Perlette, Malaga, Monukka, Muscat of Alexandria, Muscat Flame, 
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Muscat Hamburg, New York Muscat, Niabell, Niagara, Olivette 
blanche, Ontario, Pierce, Queen, Red Malaga, Ribier, Rish 
Baba, Romulus, Ruby Seedless, Schuyler, Seneca, Suavis (IP 
365) , Thompson seedless, Aleatico, Alicante Bouschet, Aligote, 
Alvarelhao, Aramon, Baco blanc (22A) , Burger, Cabernet franc, 
Cabernet, Sauvignon, Calzin, Carignane, Charbono, Chardonnay, 
Chasselas dore, Chenin blanc, Clairette blanche, Early 
Burgundy, Emerald Riesling, Feher Szagos, Fernao Pires, Flora, 
French Colombard, Fresia, Furmint, Gamay, Gewurztraminer , 
Grand noir, Gray Riesling, Green Hungarian, Green Veltliner, 
Grenache, Grillo, Helena, Inzolia, Lagrein, Lambrusco de 
Salamino, Malbec, Malvasia bianca, Mataro, Melon, Merlot, 
Meunier, Mission, Montua de Pilas, Muscadelle du Bordelais, 
Muscat blanc, Muscat Ottonel, Muscat Saint-Vallier , Nebbiolo, 
Nebbiolo fino, Nebbiolo Lampia, Orange Muscat, Palomino, Pedro 
Ximenes, Petit Bouschet, Petite Sirah, Peverella, Pinot noir, 
Pinot Saint-George, Primitivo di Gioa, Red Veltliner, Refosco, 
Rkatsiteli, Royalty, Rubired, Ruby Cabernet, Saint -Emilion, 
Saint Macaire, Salvador, Sangiovese, Sauvignon blanc, 
Sauvignon gris, Sauvignon vert, Scarlet, Seibel 5279, Seibel 
9110, Seibel 13053, Semillon, Servant, Shiraz, Souzao, Sultana 
Crimson, Sylvaner, Tannat, Teroldico, Tinta Madeira, Tinto 
cao, Touriga, Traminer, Trebbiano Toscano, Trousseau, 
Valdepenas, Viognier, Walschriesling, White Riesling, and 
Zinf andel . 

32. A transgenic Vitis scion cultivar or rootstock 
cultivar according to claim 30, wherein the Vitis scion or 
rootstock cultivar is a rootstock cultivar selected from the 
group consisting of Couderc 1202, Couderc 1613, Couderc 1616, 
Couderc 3309, Dog Ridge, Foex 33 EM, Freedom, Ganzin 1 (A x R 
#1), Harmony, Kober 5BB, LN33, Millardet & de Grasset 41B, 
Millardet fc de Grasset 420A, Millardet & de Grasset 101-14, 
Oppenheim 4 (S04) , Paulsen 775, Paulsen 1045, Paulsen 1103, 
Richter 99, Richter 110, Riparia Gloire, Ruggeri 225, Saint- 
George, Salt Creek, Teleki 5A, Vitis rupestris Constantia, 
Vitis California, and Vitis girdiana. 
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33. A method of imparting grapevine leafroll virus 
resistance to a Vitis scion cultivar or rootstock cultivar 
comprising: 

transforming a Vitis scion cultivar or rootstock cultivar 
with a an expression system according to claim 26 wherein said 
vector is a plant transformation vector. 

34. A method according to claim 33, wherein the Vitis 
scion cultivar or rootstock cultivar is a scion cultivar 
selected from the group consisting of Vitis vinifra, Alden, 
Almeria, Anab-E-Shahi, Autumn Black, Beauty Seedless, Black 
Corinth, Black Damascus. Black Malvoisie, Black Prince, 
Blackrose, Bronx Seedless, Burgrave, Calmeria, Campbell Early, 
Canner, Cardinal, Catawba, Christmas, Concord, Dattier, 
Delight, Diamond, Dizmar, Duchess, Early Muscat, Emerald 
Seedless, Emperor, Exotic, Ferdinand de Lesseps, Fiesta, Flame 
seedless, Flame Tokay, Gasconade, Gold, Himrod, Hunisa, 
Hussiene, Isabella, Italia, July Muscat. Khandahar, Katta, 
Kourgane, Kishmishi, Loose Perlette, Malaga, Monukka, Muscat 
of Alexandria, Muscat Flame, Muscat Hamburg, New York Muscat, 
Niabell, Niagara, Olivette blanche, Ontario, Pierce, Queen, 
Red Malaga, Ribier, Rish Baba, Romulus, Ruby Seedless, 
Schuyler, Seneca, Suavis (IP 365), Thompson seedless, 
Aleatico, Alicante Bouschet, Aligote, Alvarelhao, Aramon, Baco 
blanc (22A), Burger, Cabernet franc, Cabernet, Sauvignon, 
Calzin, Carignane, Charbono, Chardonnay, Chasselas dore, 
Chenin blanc, Clairette blanche, Early Burgundy, Emerald 
Riesling, Feher Szagos, Fernao Pires, Flora, French Colombard, 
Fresia, Furmint, Camay, Gewurztraminer , Grand noir, Gray 
Riesling, Green Hungarian, Green Veltliner, Grenache, Grillo, 
Helena, Inzolia, Lagrein. Lambrusco de Salaraino, Malbec, 
Malvasia bianca, Mataro, Melon, Merlot, Meunier, Mission, 
Montua de Pilas, Muscadelle du Bordelais, Muscat blanc, Muscat 
Ottonel, Muscat Saint -Vallier, Nebbiolo, Nebbiolo fine, 
Nebbiolo Lampia, Orange Muscat, Palomino, Pedro Ximenes, Petit 
Bouschet, Petite Sirah, Peverella, Pinot noir, Pinot Saint- 
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George, Primitivo di Gioa, Red Veltliner, Refosco, Rkatsiteli, 
Royalty, Rubired, Ruby Cabernet, Saint-Emilion, Saint Macaire, 
Salvador, Sangiovese, Sauvignon blanc, Sauvignon gris, 
Sauvignon vert, Scarlet, Seibel 5279, Seibel 9110, Seibel 
13053, Semillon, Servant, Shiraz, Souzao, Sultana Crimson, 
Sylvaner, Tannat, Teroldico, Tinta Madeira, Tinto cao, 
Touriga, Traminer, Trebbiano Toscano, Trousseau, Valdepenas, 
Viognier, Walschriesling, White Riesling, and Zinfandel. 

35. A method according to claim 33, wherein the Vitis 
scion cultivar or rootstock cultivar is a rootstock cultivar 
selected from the group consisting of Couderc 1202, Couderc 
1613, Couderc 1616, Couderc 3309, Dog Ridge, Foex 33 EM, 
Freedom, Ganzin 1 {A x R #1) , Harmony, Kober 5BB, LN33, 
Millardet & de Grasset 41B, Millardet & de Grasset 420A, 
Millardet & de Grasset 101-14, Oppenheim 4 (S04) , Paulsen 775, 
Paulsen 1045, Paulsen 1103, Richter 99, Richter 110, Riparia 
Gloire, Ruggeri 225, Saint -George, Salt Creek, Teleki 5A, 
Vitis rupestris Constantia, Vitis California, and Vitis 
girdiana. 

36. A method according to claim 33, wherein the 
grapevine leafroll virus is selected from the group consisting 
of GLRaV-1 , GLRaV-2, GLRaV-3, GLRaV-4 , GLRaV-5, and GLRaV-6. 

37. A method according to claim 33, wherein said 
transforming is AgroJbacteriu/n-mediated. 

38. A method according to claim 37, wherein said 
transforming comprises: 

contacting tissue of the Vitis scion cultivar or 
rootstock cultivar with an inoculum of bacterium of the genus 
Agrobacteriujn transformed with an expression construct 
according to claims 26 and regenerating a transformed plant. 
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39. A method according to claim 38, wherein the tissue 
is selected from the group consisting of leaf tissue, root 
tissue, meristems, zygotic and somatic embryos, and anthers. 

40. A method according to claim 33, wherein said 
transforming comprises: 

propelling particles at grape plant cells under 
conditions effective for the particles to penetrate into the 
cell interior and 

introducing an expression vector comprising the DNA 
molecule into the cell interior. 

41. A method according to claim 40, wherein the vector 
is associated with the particles, whereby the vector is 
carried into the cell interior together with the particles. 

42. A method according to claim 41, wherein the vector 
surrounds the cell and is drawn into the cell by the 
particle's wake. 

43. A transgenic citrus scion cultivar or rootstock 
cultivar transformed with an expression system according to 
claim 26, wherein said vector is a plant transformation 
vector. 

44. A transgenic citrus scion cultivar or rootstock 
cultivar according to claim 43, wherein the citrus scion 
cultivar or rootstock cultivar is selected from the group 
consisting of lemon, lime, orange, grapefruit, pineapple, and 
tangerine . 

45. A transgenic citrus scion cultivar or rootstock 
cultivar according to claim 43, wherein the citrus scion 
cultivar or rootstock cultivar is selected from the group 
consisting of Joppa, Maltaise Ovale, Parson (Parson Brown), 
Pera, Pineapple, Queen, Shamouti, Valencia, Tenerife, Imperial 
Doblefina, Washington Sanguine, Moro, Sanguinello Moscato, 
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Spanish Sanguinelli, Tarocco, Atwood, Australian, Bahia, 
Baiana, Cram, Dalmau, Eddy, Fisher, Frost Washington, 
Gillette, LengNavelina, Washington, Satsuma Mandarin, Dancy, 
Robinson, Ponkan, Duncan, Marsh, Pink Marsh, Ruby Red, Red 
Seedless, Smooth Seville, Orlando Tangelo, Eureka, Lisbon, 
Meyer Lemon 1 , Rough Lemon, Sour Orange, Persian Lime, West 
Indian Lime, Bearss, Sweet Lime, Troyer Citrange, and Citrus 
trifoliata. 

46. A method for producing a virus-resistant citrus 
scion cultivar or rootstock cultivar comprising: 

transforming citrus tissue with an expression vector 
according to claim 26. 

47. A method according to claim 46, wherein said 
transforming is Agrobacteri urn-mediated. 

48. A method according to claim 46, wherein the tissue 
is selected from the group consisting of leaf tissue, root 
tissue, meristems, zygotic and somatic embryos, and anthers. 

49. The method of claim 46 wherein said expression 
vector is introduced into plant cells by a biolistic method. 

50. An antibody or binding portion thereof or probe 
recognizing the protein or polypeptide encoded by the DNA 
molecule according to any of claims 1 to 25. 

51. An isolated protein or polypeptide corresponding to 
a protein or polypeptide of a grapevine leafroll virus encoded 
by a DNA molecule according to any of claims 1-25. 

52. A method for detection of grapevine leafroll virus 
in a sample, said method comprising: 

providing an antibody or binding portion thereof 
recognizing the encoded protein or polypeptide according to 
any of claims 1 to 25; 
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contacting the sample with the antibody or binding 

portion thereof; and 

detecting any reaction which indicates that grapevine 
leaf roll virus is present in the sample using an assay system. 



53. A method according to claim 55, wherein the assay 
system is selected from the group consisting of an enzyme- 
linked immunosorbent assay, a radioimmunoassay, a gel 
diffusion precipitin reaction assay, an immunodiffusion assay, 
an agglutination assay, a fluorescent immunoassay, a protein A 
immunoassay, and an Immunoelectrophoresis assay. 

54. A method for detection of grapevine leaf roll virus 
in a sample, said method comprising: 

providing a nucleotide sequence of the DNA molecule 
according to any of claims 1 to 25 as a probe in a nucleic 
acid hybridization assay; 

contacting the sample with the probe; and 
detecting any reaction which indicates that grapevine 
leaf roll virus is present in the sample. 

55. A method for detection of grapevine leaf roll virus 
in a sample : 

providing a nucleotide sequence of the DNA molecule 
according to any of claims 1 to 25 as a probe in a gene 
amplification detection procedure; 

contacting the sample with the probe; and 
detecting any reaction which indicates that grapevine 
leaf roll virus is present in the sample. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA£10 



1. claims 3-5,(1.2.26-55 ail partially) 

- An isolated DNA molecule of a grapevine leafroll-associated virus III comprising a 
nucleotide sequence corresponding to SEQ ID NO:1 and encoding a protein or polypeptide 
with a sequence corresponding to SEQ ID NO:2. the protein being a viral heticase. 

- Expression system comprising said DNA-molecule. as well as host cells transformed with 
said expression system. 

- Transgenic vitis scion cultivars or root stock cultivars comprising said DNA molecule, 
methods for the transformation of vitis. as well as methods of imparting resistance to 
Grapevine leafroil virus using said DNA molecule. 

- Methods for the transformation of citrus and methods for producing virus-resistant atrus 
cultivars using said expression system, as well as the transgenic citrus scion or rootstocK 
cultivars produced thereby. ^^m^ ~* 

- An isolated protein or polypeptide encoded by said DNA molecule as well as antibodies or 
binding portions thereof recognizing said protein or polypeptide. 

- Methods for the detection of grapevine leafroil virus in a sample using said antiooaies or 
said DNA molecule or parts thereof. 



2 . claims 6-8. (1 .2.26-55 all partially) 
idem for Seq ID NO:3 and 4 

3 . claims 9-11. (1.2.26-55 ail partially) 
idem for Seq ID NO:5 and 6 

4 . claims 12-14, (1 .2.26-55 all partially) 
idem for Seq ID NO:7 and 8 

5 . claims 15-17, (1,2.26-55 all partially) 
idem for Seq ID NO:9 and 10 

6 . claims 18.19, (1.2.26-55 ail .partially) 
idem for Seq ID NO:11 and 12 

7 . claims 20.21 . (1 .2.26-55 all partially) 
idem for Seq ID NO:13 and 14 
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8 . claims 22.23. (1 .2.26-55 all partially) 
idem for Seq ID NO: 15 and 16 

9 . claims 24.25. (1 .2.26-55 all partially) 
idem for Seq ID NO:17 and 18 

10. claims (1.26-55 all partially) 

idem for isolated DNA molecules of grapevine ieafroll virus not included in the previous 
solutions 



